Herbicide tolerant cotton plants and methods for producing and identifying same

ABSTRACT

The invention pertains to transgenic cotton plants, plant material and seeds, characterized by harboring a specific transformation event, particularly by the presence of a gene encoding a protein that confers herbicide tolerance, at a specific location in the cotton genome. The cotton plants of the invention combine the herbicide tolerant phenotype with optimal agronomic performance.

FIELD OF THE INVENTION

[0001] This invention pertains to transgenic cotton plants, plant material and seeds, characterized by harboring a specific transformation event, particularly by the presence of a gene encoding a protein that confers herbicide tolerance, at a specific location in the cotton genome. The cotton plants of the invention combine the herbicide tolerant phenotype with an agronomic performance, genetic stability and adaptability to different genetic backgrounds equivalent to the non-transformed cotton line in the absence of weed pressure.

[0002] All documents cited herein are hereby incorporated herein by reference.

BACKGROUND OF THE INVENTION

[0003] The phenotypic expression of a transgene in a plant is determined both by the structure of the gene itself and by its location in the plant genome. At the same time the presence of the transgene at different locations in the genome will influence the overall phenotype of the plant in different ways. The agronomically or industrially successful introduction of a commercially interesting trait in a plant by genetic manipulation can be a lengthy procedure dependent on different factors. The actual transformation and regeneration of genetically transformed plants are only the first in a series of selection steps, which include extensive genetic characterization, breeding, and evaluation in field trials.

[0004] Cotton fiber is the single most important textile worldwide. About 80 million acres of cotton are harvested annually across the globe. Cotton is the fifth largest crop in the U.S. in terms of acreage production, with over 15 million acres planted in 2000. Primary weed species for cotton are Ipomoea sp. (morning glory), Amaranthus spp. (pigweed), Cyperus spp. (nutsedge), Xanthium spp. (cocklebur) and Sorghum spp. (johnsongrass). Before the introduction of broad-leaf herbicides that could be used on a growing cotton field, growers used directed, post-emergence applications of nonselective herbicides taking care not to contact the growing crop plants. As this requires a difference in height between the weeds and the crop, this is not always possible. Especially for small cotton, this practice is time-consuming and potentially damaging to the crop.

[0005] The bar gene (Thompson et al, 1987, EMBO J 6:2519-2523; Deblock et al. 1987, EMBO J. 6:2513-2518) is a gene encoding the enzyme phosphinothricin acetyl transferase (PAT), which, when expressed in a plant, confers resistance to the herbicidal compounds phosphinothricin (also called glufosinate) or bialaphos (see also for example U.S. Pat. Nos. 5,646,024 and 5,561,236) and salts and optical isomers thereof. Phosphinothricin controls broadleaf weeds including morning glory and has a wide window of application.

[0006] Successful genetic transformation of cotton has been obtained by a number of methods including Agrobacterium infection of cotton explants (Firoozabady et al. 1987, Plant Molecular Biology 10:105-116; Umbeck et al. 1987, Bio/Technology 5:263-266 and in WO 00/71733, U.S. Pat. No. 5,004,863, and U.S. Pat. No. 5,159,135), as well as direct gene transfer by microprojectile bombardment of meristematic cotton tissues (Finer and Mc Mullen, 1990, Plant Cell Reports, 5:586-589; McCabe and Martinell, 1993, Bio/Technology 11:596-598, WO92/15675, EP0 531 506). Increased transformation efficiency for Agrobacterium transformation has been reported using the methods described by Hansen et al. (1994, Proc. Nat. Acad. Sci. 91:7603-7607) Veluthambi et al. (1989, Journal of Bacteriology 171:3696-3703) and WO 00/71733.

[0007] Different methods for regeneration of cotton plants have also been described (WO 89/05344, U.S. Pat. No. 5,244,802, U.S. Pat. No. 5,583,036, WO89/12102, WO98/15622, and WO97/12512).

[0008] However, the foregoing documents fail to teach or suggest the present invention.

SUMMARY OF THE INVENTION

[0009] The present invention relates to a transgenic cotton plant, or seed, cells or tissues thereof, comprising, stably integrated into its genome, an expression cassette which comprises a herbicide tolerance gene comprising the coding sequence of the bar gene (as described in Example 1.1 herein), which is herbicide tolerant and, in the absence of weed pressure, has an agronomic performance which is substantially equivalent to the non-transgenic isoline. Under weed pressure and the appropriate Liberty™ treatment, the plant will have a superior agronomic phenotype compared to the non-transgenic plant.

[0010] In one embodiment of the invention, the cotton plant or seed, cells or tissues thereof, comprises the expression cassette of pGSV71 (as described in Example 1.1, Table 1 herein). In the preferred embodiment of the invention the cotton plant or seed, cells or tissues thereof comprise elite event EE-GH1.

[0011] In another embodiment of the invention, the transgenic cotton plant or seed, cells or tissues thereof comprises:

[0012] (i) event EE-GH1 in its genome ;or

[0013] (ii) event EE-GH1 with the proviso that the bar gene used in the event is substituted with a nucleic acid sequence that hybridizes to the complement of the bar gene under stringent conditions.

[0014] More specifically, the present invention relates to a transgenic cotton plant, seed, cells or tissues thereof, the genomic DNA of which is characterized by the fact that, when analyzed in a PCR identification protocol as described herein, using two primers directed to the 5′ or 3′ flanking region of EE-GH1 and the foreign DNA, respectively, yields a fragment which is specific for EE-GH1. Preferably the primers are directed against the 5′ flanking region within SEQ ID NO: 1 and the foreign DNA respectively; most preferably, the primers comprise the nucleotide sequence of SEQ ID NO: 2 and SEQ ID NO: 3 respectively, and yield a DNA fragment of between 250 and 290 bp, preferably of about 269 bp.

[0015] Reference seed comprising the elite event of the invention has been deposited at the ATCC under accession number PTA-3343. Thus, a preferred embodiment of the invention is the seed comprising elite event EE-GH1 deposited as ATTC accession number PTA-3343, which will grow into a cotton plant resistant to glufosinate. The seed of ATCC deposit number PTA-3343, which is a seed lot consisting of about 50% non-transgenic kernels and 50% transgenic kernels hemizygous for the transgene, comprising the elite event of the invention, which will grow into glufosinate tolerant plants. The seed can be sown and the growing plants can be treated with PPT or Liberty™ as described herein to obtain 100% glufosinate tolerant plants, comprising the elite event of the invention. The invention further relates to cells, tissues, progeny, and descendants from a plant comprising the elite event of the invention grown from the seed deposited at the ATCC having accession number PTA-3343. The invention further relates to plants obtainable by propagation of and/or breeding with a cotton plant comprising the elite event of the invention grown from the seed deposited at the ATCC having accession number PTA-3343.

[0016] The invention further relates to plants, seeds, cells or tissues comprising a foreign DNA sequence, preferably a herbicide tolerance gene as described herein, integrated into the chromosomal DNA in a region which comprises the plant DNA sequence of SEQ ID NO: 1 and/or SEQ ID NO: 4, more particularly which comprises the DNA sequence of SEQ ID NO: 5, or a sequence which hybridizes under stringent conditions to a sequence that is complementary to a sequence comprising the plant DNA sequence of SEQ ID NO: 1, SEQ ID NO: 4 and/or SEQ ID NO: 5.

[0017] The invention further provides a process for producing a transgenic cell of a cotton plant, which comprises inserting a recombinant DNA molecule into a region of the chromosomal DNA of a cotton cell, tissue or callus which comprises the plant DNA sequence of SEQ ID NO: 1 and/or SEQ ID NO: 4, more particularly which comprises the DNA sequence of SEQ ID NO: 5, or which comprises a sequence which hybridizes under stringent conditions to a sequence that is complementary to a sequence comprising the plant DNA sequence of SEQ ID NO: 1, SEQ ID NO: 4 and/or SEQ ID NO: 5.

[0018] The invention further relates to a method for identifying a transgenic plant, or cells or tissues thereof, comprising elite event EE-GH1 which method is based on identifying the presence of characterizing DNA sequences or amino acids encoded by such DNA sequences in the transgenic plant, cells or tissues.

[0019] According to one preferred aspect of the invention, the method for identifying a transgenic plant, or cells or tissues thereof, comprising elite event EE-GH1, comprises amplifying a sequence of a nucleic acid present in biological samples, using a polymerase chain reaction, with at least two primers, one of which recognizes the plant DNA in the 5′ or 3′ flanking region of EE-GH1, the other which recognizes a sequence within the foreign DNA. Preferably, the genomic DNA is analyzed using primers which recognize a sequence within the plant 5′ flanking region of EE-GH1, most preferably within the plant DNA sequence in SEQ ID NO: 1, and a sequence within the foreign DNA, respectively. Especially preferably, the genomic DNA is analyzed according to the PCR identification protocol described herein whereby the primer recognizing a sequence within the 5′ flanking region comprises the nucleotide sequence of SEQ ID NO: 2.

[0020] Particularly, the primer recognizing a sequence within the 5′ flanking region comprises the nucleotide sequence of SEQ ID NO: 2 and the primer recognizing a sequence within the foreign DNA comprises the nucleotide sequence of SEQ ID NO: 3, so that the amplified fragment is a fragment preferably of between 250 and 290 bp, preferably of about 269 bp. Accordingly, the present invention relates to the transgenic plant, cells or tissues thereof which can be identified according the above-described identification method for EE-GH1.

[0021] The present invention relates to methods for identifying elite event EE-GH1 in biological samples, which methods are based on primers or probes that specifically recognize the 5′ and/or 3′ flanking sequence of EE-GH1. In a preferred embodiment of the invention these methods are based on primers or probes which recognize a sequence within SEQ ID NO: 1 and/or SEQ ID NO: 4, more particularly primers or probes comprising the sequence of SEQ ID NO: 2.

[0022] The present invention further relates to the specific flanking sequences of EE-GH1 described herein, which can be used to develop specific identification methods for EE-GH1 in biological samples. More particularly, the invention relates to the 5′ and or 3′ flanking regions of EE-GH1, which can be used for the development of specific primers and probes as well as to the specific primers and probes developed from the 5′ and/or 3′ flanking sequences of EE-GH1. The invention further relates to identification methods for the presence of EE-GH1 in biological samples based on the use of such specific primers or probes.

[0023] The invention thus also relates to a kit for identifying elite event EE-GH1 in biological samples, the kit comprising at least one primer or probe which specifically recognizes the 5′ or 3′ flanking region of EE-GH1.

[0024] The invention also relates to a kit for identifying elite event EE-GH1 in biological samples, which kit comprises at least one specific primer or probe having a sequence which corresponds (or is complementary to) a sequence that hybridizes under stringent conditions to a specific region of EE-GH1. Preferably the sequence of the probe corresponds to a specific region comprising part of the 5′ or 3′ flanking region of EE-GH1. Most preferably the specific probe has (or is complementary to) a sequence that hybridizes under stringent conditions to the plant DNA sequence within SEQ ID NO: 1 or SEQ ID NO: 4.

[0025] Preferably the kit of the invention comprises, in addition to a primer which specifically recognizes the 5′ or 3′ flanking region of EE-GH1, a second primer which specifically recognizes a sequence within the foreign DNA of EE-GH1, for use in a PCR identification protocol. Preferably, the kit of the invention comprises two (or more) specific primers, one of which recognizes a sequence within the 5′ flanking region of EE-GH1, most preferably a sequence within the plant DNA region of SEQ ID NO: 1, and an other which recognizes a sequence within the foreign DNA. Especially preferably, the primer recognizing the plant DNA sequence within 5′ flanking region comprises the nucleotide sequence of SEQ ID NO: 2. Particularly, the primer recognizing the plant DNA sequence within 5′ flanking region comprises the nucleotide sequence of SEQ ID NO: 2 and the primer recognizing the foreign DNA comprises the nucleotide sequence of SEQ ID NO: 1 described herein.

[0026] The methods and kits encompassed by the present invention can be used for different purposes such as, but not limited to the following: to identify EE-GH1 in plants, plant material or in products such as, but not limited to food or feed products (fresh or processed) comprising or derived from plant material; additionally or alternatively, the methods and kits of the present invention can be used to identify transgenic plant material for purposes of segregation between transgenic and non-transgenic material; additionally or alternatively, the methods and kits of the present invention can be used to determine the quality (i.e. percentage pure material) of plant material comprising EE-GH1.

[0027] The present invention further relates to a method for tracking plants comprising elite event EE-GH1 in their genome upon introduction into different cultivars.

[0028] It will be understood that particular embodiments of the invention are described by the dependent claims cited herein.

BRIEF DESCRIPTION OF THE FIGURE

[0029] The following detailed description, given by way of example, but not intended to limit the invention to specific embodiments described, may be understood in conjunction with the accompanying Figure, incorporated herein by reference, in which:

[0030]FIG. 1. PCR analysis of other events and elite event EE-GH1 using the EE-GH1 PCR identification protocol. Loading sequence of the gel: lane 1, molecular weight marker (100 bp ladder), lane 2, DNA sample from a cotton plant comprising the transgenic event EE-GH1, lane 3, DNA samples from a cotton plant comprising another transgenic event, lane 4, DNA from wild-type cotton, lane 5, wild-type+1 copy of the pGSV71-BamHI digest (positive control), lane 6, negative control (no template), lane 8, molecular weight marker (100 bp ladder).

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

[0031] The term “gene” as used herein refers to any DNA sequence comprising several operably linked DNA fragments such as a promoter region, a 5′ untranslated region (the 5′UTR), a coding region (which may or may not code for a protein), and an untranslated 3′ region (3′UTR) comprising a polyadenylation site. Typically in plant cells, the 5′UTR, the coding region and the 3′UTR are transcribed into an RNA of which, in the case of a protein encoding gene, the coding region is translated into a protein. A gene may include additional DNA fragments such as, for example, introns. As used herein, a genetic locus is the position of a given gene in the genome of a plant.

[0032] The term “chimeric” when referring to a gene or DNA sequence is used to refer to the fact that the gene or DNA sequence comprises at least two functionally relevant DNA fragments (such as promoter, 5′UTR, coding region, 3′UTR, intron) that are not naturally associated with each other and/or originate, for example, from different sources. “Foreign” referring to a gene or DNA sequence with respect to a plant species is used to indicate that the gene or DNA sequence is not naturally found in that plant species, or is not naturally found in that genetic locus in that plant species. The term “foreign DNA” will be used herein to refer to a DNA sequence as it has incorporated into the genome of a plant as a result of transformation. The “transforming DNA” as used herein refers to a recombinant DNA molecule used for transformation. The transforming DNA usually comprises at least one “gene of interest” (e.g. a chimeric gene) which is capable of conferring one or more specific characteristics to the transformed plant. The term “recombinant DNA molecule” is used to exemplify and thus can include an isolated nucleic acid molecule which can be DNA and which can be obtained through recombinant or other procedures.

[0033] As used herein the term “transgene” refers to a gene of interest as incorporated in the genome of a plant. A “transgenic plant” refers to a plant comprising at least one transgene in the genome of all of its cells.

[0034] The foreign DNA present in the plants of the present invention will preferably comprise a herbicide tolerance gene, more specifically a 35S-bar gene as the gene of interest.

[0035] A “herbicide tolerance” gene as used herein refers to a gene that renders the plant tolerant to a herbicide. An example of a herbicide tolerance gene is gene comprising a sequence encoding the enzyme phosphinothricin acetyl transferase, which detoxifies phosphinothricin, under the control of a constitutive promoter. More specifically, in the elite event of the present invention the herbicide tolerance gene comprises the coding sequence of the bialaphos resistance gene (bar) of Streptomyces hygroscopicus (Thompson et al. (1987) EMBO J 6: 2519-2523) under control of the 35S promoter from Cauliflower Mosaic Virus (Odell et al., (1985), Nature 313: 810-812), also referred to as “35S-bar” herein. The expression of the 35S-bar gene confers tolerance to herbicidal compounds phosphinothricin or bialaphos or glufosinate, or more generally, glutamine synthase inhibitors, or salts or optical isomers thereof which will generally be referred to as “glufosinate tolerance” herein.

[0036] By hybridizing under “stringent conditions” is meant the conventional hybridizing conditions as described by Sambrook et al. (1989) (Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbour Laboratory Press, NY) which for instance can comprise the following steps: 1) immobilizing plant genomic DNA fragments on a filter, 2) prehybridizing the filter for 1 to 2 hours at 42° C. in 50% formamide, 5×SSPE, 2× Denhardt's reagent and 0.1% SDS, or for 1 to 2 hours at 68° C. in 6×SSC, 2× Denhardt's reagent and 0.1% SDS, 3) adding the hybridization probe which has been labeled, 4) incubating for 16 to 24 hours, 5) washing the filter for 20 min. at room temperature in 1×SSC, 0.1%SDS, 6) washing the filter three times for 20 min. each at 68° C. in 0.2 ×SSC, 0.1% SDS, and 7) exposing the filter for 24 to 48 hours to X-ray film at −70° C. with an intensifying screen.

[0037] The incorporation of a recombinant DNA molecule in the plant genome typically results from transformation of a cell, tissue or callus (or from another genetic manipulation). The particular site of incorporation is either random or is at a predetermined location (if a process of targeted integration is used).

[0038] The DNA introduced into the plant genome as a result of transformation of a plant cell or tissue with a recombinant DNA or “transforming DNA” is hereinafter referred to as “foreign DNA” comprising one or more “transgenes”. Thus, foreign DNA may comprise both recombinant DNA as well as newly introduced, rearranged DNA of the plant. However, the term “plant DNA” in the context of the present invention will refer to DNA of the plant which is generally found in the same genetic locus in the corresponding wild-type plant.

[0039] The foreign DNA can be characterized by the location and the configuration at the site of incorporation of the recombinant DNA molecule in the plant genome. The site in the plant genome where a recombinant DNA has been inserted is also referred to as the “insertion site” or “target site”. Insertion of the recombinant DNA into the plant genome can be associated with a deletion of plant DNA, referred to as “target site deletion”. A “flanking region” or “flanking sequence” as used herein refers to a sequence of at least 20 bp, preferably at least 50 bp, and up to 5000 bp of the plant genome which is located either immediately upstream of and contiguous with or immediately downstream of and contiguous with the foreign DNA. Transformation procedures leading to random integration of the foreign DNA will result in transformants with different flanking regions, which are characteristic and unique for each transformant. When the recombinant DNA present in a transgenic plant is introduced into a different plant through traditional crossing, its insertion site in the plant genome, or its flanking regions will generally not be changed (apart from occasional changes due to mutations or cross-over and transposons). An “insertion region” as used herein refers to the region corresponding to the region of at least 40 bp, preferably at least 100 bp, and up to more than 10000 bp, encompassed by the sequence which comprises the upstream and/or the downstream flanking region of a foreign DNA in the (untransformed) plant genome (and including the insertion site and possible target site deletion). Taking into consideration minor differences due to mutations within a species, an insertion region will retain at least 85%, preferably 90%, more preferably 95%, and most preferably 100% sequence identity with the sequence comprising the upstream and downstream flanking regions of the foreign DNA in a given plant of that species.

[0040] Expression of a gene of interest refers to the fact that the gene confers on the plant one or more phenotypic traits (e.g. herbicide tolerance) that were intended to be conferred by the introduction of the recombinant DNA molecule—the transforming DNA—used during transformation (on the basis of the structure and function of part or all of the gene(s) of interest).

[0041] An “event” is defined as a (artificial) genetic locus that, as a result of genetic engineering, carries a foreign DNA comprising at least one copy of the gene(s) of interest (also referred to as a transformation event). An event is characterized phenotypically by the expression of the transgenes. At the genetic level, an event is part of the genetic makeup of a plant. At the molecular level, an event is characterized by the restriction map (e.g. as determined by Southern blotting) and/or by the upstream and/or downstream flanking sequences of the foreign DNA, and/or the molecular configuration of the foreign DNA comprising the transgenes. Usually when transforming a plant cell, tissue or callus with a transforming DNA, a multitude of events are generated, each of which is unique.

[0042] An “elite event”, as used herein, is an event which is selected from a group of events, obtained by transformation with the same transforming DNA or by back-crossing with plants obtained by such transformation, based on the phenotypic expression and stability of the transgenes and the absence of negative impact on the agronomic characteristics of the plant comprising it (i.e., selected transformation event). Thus the criteria for elite event selection are one or more, preferably two or more, advantageously all of the following:

[0043] a) That the presence of the foreign DNA in the plant does not compromise other desired characteristics of the plant, such as those relating to agronomic performance or commercial value;

[0044] b) That the event is characterized by a well defined molecular configuration which is stably inherited and for which appropriate diagnostic tools for identity control can be developed;

[0045] c) That the gene(s) of interest show(s) an appropriate and stable spatial and temporal phenotypic expression in homozygous condition of the event, at a commercially acceptable level in a range of environmental conditions in which the plants carrying the event are likely to be exposed in normal agronomic use.

[0046] It is preferred that the foreign DNA is associated with a position in the plant genome that allows introgression into desired commercial genetic backgrounds. The status of an event as an elite event is confirmed by introgression of the elite event in different relevant genetic backgrounds and observing compliance with one, two or all of the criteria e.g. a), b) and c) above.

[0047] An “elite event” thus refers to a genetic locus comprising a foreign DNA, which answers to the above-described criteria. A plant, plant material or progeny such as seeds can comprise one or more elite events in its genome. Thus, when referring to a plant, seed cell or tissue comprising elite event EE-GH1 in its genome, a plant, seed cell or tissue is intended which comprises the foreign DNA described herein (comprising the 35S-bar gene) integrated in its genome at the integration site described herein.

[0048] The tools developed to identify an elite event or the plant or plant material comprising an elite event, or products which comprise plant material comprising the elite event are based on the specific genomic characteristics of the elite event, such as, a specific restriction map of the genomic region comprising the foreign DNA, molecular markers or the sequence of the flanking region(s) of the foreign DNA.

[0049] Once one or both of the flanking regions of the foreign DNA have been sequenced, primers and probes can be developed which specifically recognize this (these) sequence(s) in the nucleic acid (DNA or RNA) of a sample by way of a molecular biological technique. For instance a PCR method can be developed to identify the elite event in biological samples (such as samples of plants, plant material or products comprising plant material). Such a PCR is based on at least two specific “primers” one recognizing a sequence within the 5′ or 3′ flanking region of the elite event and the other recognizing a sequence within the foreign DNA. The primers preferably have a sequence of between 15 and 35 nucleotides which under optimized PCR conditions “specifically recognize” a sequence within the 5′ or 3′ flanking region of the elite event and the foreign DNA of the elite event respectively, so that a specific fragment (“integration fragment”) is amplified from a nucleic acid sample comprising the elite event. This means that only the targeted integration fragment, and no other sequence (of that size) in the plant genome or foreign DNA, is amplified under optimized PCR conditions. Preferably, the integration fragment has a length of between 50 and 500 nucleotides, most preferably of between 100 and 350 nucleotides. Preferably the specific primers have a sequence which is between 80 and 100% identical to a sequence within the 5′ or 3′ flanking region of the elite event and the foreign DNA of the elite event, respectively, provided the mismatches still allow specific identification of the elite event with these primers under optimized PCR conditions. The range of allowable mismatches however, can easily be determined experimentally and are known to a person skilled in the art. As the sequence of the primers and their recognized sequence in the genome are unique for the elite event, amplification of the integration fragment will occur only in biological samples comprising (the nucleic acid of) the elite event. Preferably when performing a PCR to identify the presence of EE-GH1 in unknown samples, a control is included of a set of primers with which a fragment within a “housekeeping gene” of the plant species of the event can be amplified. Housekeeping genes are genes that are expressed in most cell types and which are concerned with basic metabolic activities common to all cells. Preferably, the fragment amplified from the housekeeping gene is a fragment which is larger than the amplified integration fragment. Depending on the samples to be analyzed, other controls can be included.

[0050] Standard PCR protocols are described in the art, such as in “PCR Applications Manual” (Roche Molecular Biochemicals, 2nd Edition, 1999). The optimal conditions for the PCR, including the sequence of the specific primers, is specified in a “PCR identification protocol” for each elite event. It is however understood that a number of parameters in the PCR identification protocol may need to be adjusted to specific laboratory conditions, and may be modified slightly to obtain similar results. For instance, use of a different method for preparation of DNA may require adjustment of, for instance, the amount of primers, polymerase and annealing conditions used. Similarly, the selection of other primers may dictate other optimal conditions for the PCR identification protocol. These adjustments will however be apparent to a person skilled in the art, and are furthermore detailed in current PCR application manuals such as the one cited above.

[0051] Alternatively, specific primers can be used to amplify an integration fragment that can be used as a “specific probe” for identifying EE-GH1 in biological samples. Contacting nucleic acid of a biological sample, with the probe, under conditions which allow hybridization of the probe with its corresponding fragment in the nucleic acid, results in the formation of a nucleic acid/probe hybrid. The formation of this hybrid can be detected (e.g. labeling of the nucleic acid or probe), whereby the formation of this hybrid indicates the presence of EE-GH1. Such identification methods based on hybridization with a specific probe (either on a solid phase carrier or in solution) have been described in the art. The specific probe is preferably a sequence which, under optimized conditions, hybridizes specifically to a region within the 5′ or 3′ flanking region of the elite event possibly also comprising part of the foreign DNA contiguous therewith (hereinafter also referred to as a “specific region” of the event). Preferably, the specific probe comprises a sequence of between 50 and 500 bp, preferably of 100 to 350 bp which hybridizes under stringent conditions to the nucleotide sequence (or the complement of such sequence) of a specific region. Preferably, the specific probe will comprise a sequence of about 15 to about 100 contiguous nucleotides identical (or complementary) to a specific region of the elite event.

[0052] A “restriction map” as used herein refers to a set of Southern blot patterns obtained after cleaving plant genomic DNA (and/or the foreign DNA comprised therein) with a particular restriction enzyme, or set of restriction enzymes and hybridization with a probe sharing sequence similarity with the foreign DNA under standard stringency conditions. Standard stringency conditions as used herein refers to the conditions for hybridization described herein or to the conventional hybridizing conditions as described by Sambrook et al. (1989) (Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbour Laboratory Press, NY) which for instance can comprise the following steps: 1) immobilizing plant genomic DNA fragments on a filter, 2) prehybridizing the filter for 1 to 2 hours at 42° C. in 50% formamide, 5×SSPE, 2× Denhardt's reagent and 0.1% SDS, or for 1 to 2 hours at 68° C. in 6×SSC, 2× Denhardt's reagent and 0.1% SDS, 3) adding the hybridization probe which has been labeled, 4) incubating for 16 to 24 hours, 5) washing the filter for 20 min. at room temperature in 1×SSC, 0.1% SDS, 6) washing the filter three times for 20 min. each at 68° C. in 0.2×SSC, 0.1%SDS, and 7) exposing the filter for 24 to 48 hours to X-ray film at -70° C. with an intensifying screen.

[0053] Due to the (endogenous) restriction sites present in a plant genome prior to incorporation of the foreign DNA, insertion of a foreign DNA will alter the specific restriction map of that genome. Thus, a particular transformant or progeny derived thereof can be identified by one or more specific restriction patterns.

[0054] Alternatively, plants or plant material comprising an elite event can be identified by testing according to a PCR identification protocol. This is a PCR which specifically recognizes the elite event. Essentially, a set of PCR primers is developed which recognizes a) a sequence within the 3′ or 5′ flanking sequence of the elite event and b) a sequence within the foreign DNA, which primers amplify a fragment (integration fragment) preferably of between 100 and 300 nucleotides. Preferably, a control is included of a set of primers which amplifies a fragment within a housekeeping gene of the plant species (preferably a fragment which is larger than the amplified integration fragment). The optimal conditions for the PCR, including the sequence of the specific primers is specified in a PCR identification protocol.

[0055] Other methods for identifying plants, plant material or products comprising plant material comprising elite event EE-GH1 are also envisaged. These methods include all methods based on the detection of the foreign DNA sequence and flanking sequence(s) of the elite event with a specific probe. More particularly, chip-based technologies, such as those described by Hacia et al. 1996 (Nat Genet 14(4):441-447) and Shoemaker et al. 1996 (Nat Genet 14(4):450-456) are envisaged. These methods allow segregation of target molecules as high-density arrays by using fixed probe arrays or by tagging of the genes with oligonucleotides, after which they can be screened by hybridization.

[0056] Identification of the protein(s) encoded by the foreign DNA of the elite event can be done by classical protein detection methods described in the art, such as those based on chromatographic or electromagnetic properties of the protein or the detection by specific monoclonal antibodies (as described in “Guide to protein purification, Murray P. Deutscher editor).

[0057] A “kit” as used herein refers to a set of reagents for the purpose of performing the method of the invention, more particularly, the identification of the elite event EE-GH1 in biological samples. More particularly, a preferred embodiment of the kit of the invention comprises at least one or two specific primers, as described above. Optionally, the kit can further comprise any other reagent described herein in the PCR identification protocol. Alternatively, according to another embodiment of this invention, the kit can comprise a specific probe, as described above, which specifically hybridizes with nucleic acid of biological samples to identify the presence of EE-GH1 therein. Optionally, the kit can further comprise any other reagent (such as but not limited to hybridizing buffer, label) for identification of EE-GH1 in biological samples, using the specific probe.

[0058] The kit of the invention can be used, and its components can be specifically adjusted, for purposes of quality control (e.g., purity of seed lots), detection of the elite event in plant material or material comprising or derived from plant material, such as but not limited to food or feed products.

[0059] The present invention relates to the development of an elite event in cotton, EE-GH1, to the plants comprising this event, the progeny obtained from these plants and to the plant cells, or plant material derived from this event. Plants comprising elite event EE-GH1 were obtained through transformation with pGSV71 as described in example 1.

[0060] Cotton plants or plant material comprising EE-GH1 can be identified according to the PCR identification protocol described for EE-GH1 in Example 4 herein. Briefly, cotton genomic DNA is amplified by PCR using a primer which specifically recognizes a sequence within the 5′ or 3′ flanking sequence of EE-GH1, particularly the primer with the sequence of SEQ ID NO: 2, and a primer which recognizes a sequence in the foreign DNA, particularly the primer with the sequence of SEQ ID NO: 3. Endogenous cotton DNA primers are used as controls. If the plant material yields a fragment of between 250 and 290 bp, preferably of about 269 bp, the cotton plant is determined to harbor elite event EE-GH1.

[0061] Plants harboring EE-GH1 are characterized by their glufosinate tolerance, which in the context of the present invention includes that plants are tolerant to the herbicide Liberty™. Tolerance to Liberty™ can be tested in different ways. The leaf paint method as described herein, is most useful when you wish to identify both resistant and sensitive plants, but do not want to kill the sensitive ones. Alternatively, tolerance can be tested by Liberty™ spray application. Spray treatments should be made between the leaf stages V3 and V4 for best results. Tolerant plants are characterized by the fact that spraying of the plants with at least 200 grams active ingredient/hectare (g.a.i./ha), preferably 400 g.a.i./ha, and possibly up to 1600 g.a.i./ha (4× the normal field rate), does not kill the plants. A broadcast application should be applied at a rate of 28-34 oz Liberty™. It is best to apply at a volume of 20 gallons of water per acre using a flat fan type nozzle while being careful not to direct spray applications directly into the whorl of the plants to avoid surfactant burn on the leaves. The herbicide effect should appear within 48 hours and be clearly visible within 5-7 days.

[0062] Plants harboring EE-GH1 can further be characterized by the presence in their cells of phosphinothricin acetyl transferase as determined by a PAT assay (De Block et al, 1987, supra).

[0063] Plants harboring EE-GH1 can, for example, be obtained from seeds deposited at the ATCC under accession number PTA-3343, which contain 50% kernels that are hemizigous for the elite event. Such plants can be further propagated to introduce the elite event of the invention into other cultivars of the same plant species. Selected seeds obtained from these plants contain the elite event stably incorporated into their genome. The invention further relates to plants derived from the ATCC accession number PTA-334, comprising EE-GH1. The term “derived from” herein indicates that the plants are related, i.e. they are both progeny (direct or of two or more generations) of the same transformant by crossing.

[0064] Plants harboring EE-GH1 are also characterized by having agronomical characteristics that are comparable to commercially available varieties of cotton in the US, in the absence of weed pressure and use of Liberty™ for weed control. It has been observed that the presence of a foreign DNA in the insertion region of the cotton plant genome described herein, confers particularly interesting phenotypic and molecular characteristics to the plants comprising this event. More specifically, the presence of the foreign DNA in this particular region in the genome of these plants, results in plants which display a stable phenotypic expression of the gene of interest without significantly compromising any aspect of desired agronomic performance of the plants. Thus, the insertion region, corresponding to a sequence comprising the plant DNA of SEQ ID NO: 1 and/or SEQ ID NO: 4, more particularly a sequence corresponding to SEQ ID NO: 5, most particularly the insertion site of EE-GH1 therein, is shown to be particularly suited for the introduction of a gene(s) of interest. More particularly, the insertion region of EE-GH1 (corresponding to a DNA sequence of at least 40 bp in the cotton genome within SEQ ID NO: 5), or a sequence of at least 40 bp which hybridizes under stringent conditions to the complement of the sequence of SEQ ID NO: 5, is particularly suited for the introduction of foreign DNA comprising a herbicide tolerance gene, ensuring expression of each of these genes in the plant without compromising agronomic performance.

[0065] A recombinant DNA molecule can be specifically inserted in an insertion region by targeted insertion methods. Such methods are well known to those skilled in the art and comprise, for example, homologous recombination using a recombinase such as, but not limited to the FLP recombinase from Saccharomyces cerevisiae (published PCTP application WO 99/25821), the CRE recombinase from Escherichia coli phage P1 (published PCT application WO 99/25840), the recombinase from pSR1 of Saccharomyces rouxii (Araki et al. 1985, J Mol Biol 182:191-203), the Gin/gix system of phage Mu (Maeser and Kahlmann, 1991, Mol Gen Genetics 230:170-176) or the lambda phage recombination system (such as described in U.S. Pat. No. 4,673,640).

[0066] As used herein, “sequence identity” with regard to nucleotide sequences (DNA or RNA), refers to the number of positions with identical nucleotides divided by the number of nucleotides in the shorter of the two sequences. The alignment of the two nucleotide sequences is performed by the Wilbur and Lipmann algorithm (Wilbur and Lipmann, 1983, Proc Nat Acad Sci USA 80:726) using a window-size of 20 nucleotides, a word length of 4 nucleotides, and a gap penalty of 4. Computer-assisted analysis and interpretation of sequence data, including sequence alignment as described above, can, e.g., be conveniently performed using the programs of the Wisconsin Package (from the Genetics Computer Group, Inc). Sequences are indicated as “essentially similar” when such sequences have a sequence identity of at least about 75%, particularly at least about 80%, more particularly at least about 85%, quite particularly about 90%, especially about 95%, more especially about 100%, quite especially are identical. It is clear that when RNA sequences are said to be essentially similar or have a certain degree of sequence identity with DNA sequences, thymine (T) in the DNA sequence is considered equal to uracil (U) in the RNA sequence. “Complementary to” as used herein refers to the complementarity between the A and T (U), and G and C nucleotides in nucleotide sequences.

[0067] As used in herein, a “biological sample” is a sample of a plant, plant material or products comprising plant material. The term “plant” is intended to encompass cotton (such as but not limited to Gossypium hirsutum) plant tissues, at any stage of maturity, as well as any cells, tissues, or organs taken from or derived from any such plant, including without limitation, any seeds, leaves, stems, flowers, roots, single cells, gametes, cell cultures, tissue cultures or protoplasts. “Plant material”, as used herein refers to material which is obtained or derived from a plant. Products comprising plant material relate to food, feed or other products which are produced using plant material or can be contaminated by plant material. It is understood that, in the context of the present invention, such biological samples are preferably tested for the presence of nucleic acids specific for EE-GH1, implying the presence of nucleic acids in the samples. Thus the methods referred to herein for identifying elite event EE-GH1 in biological samples, preferably relate to the identification in biological samples of nucleic acids which comprise the elite event.

[0068] As used herein “comprising” is to be interpreted as specifying the presence of the stated features, integers, steps or components as referred to, but does not preclude the presence or addition of one or more features, integers, steps or components, or groups thereof. Thus, e.g., a nucleic acid or protein comprising a sequence of nucleotides or amino acids, may comprise more nucleotides or amino acids than the actually cited ones, i.e., be embedded in a larger nucleic acid or protein. A chimeric gene comprising a DNA sequence which is functionally or structurally defined, may comprise additional DNA sequences, etc.

[0069] The following examples describe the development and characteristics of cotton plants harboring the elite events EE-GH1 as well as the development of tools for the identification of elite event EE-GH1 in biological samples.

[0070] Unless otherwise stated, all recombinant DNA techniques are carried out according to standard protocols as described in Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbour Laboratory Press, NY and in Volumes 1 and 2 of Ausubel et al. (1994) Current Protocols in Molecular Biology, Current Protocols, USA. Standard materials and methods for plant molecular work are described in Plant Molecular Biology Labfax (1993) by R. D. D. Croy published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific Publications, UK.

[0071] In the description and examples, reference is made to the following sequences: SEQ ID NO: 1: sequence comprising the 5′ flanking region SEQ ID NO: 2: primer GHI06 SEQ ID NO: 3: primer GHI05 SEQ ID NO: 4: sequence comprising the 3′ flanking region SEQ ID NO: 5: insertion region SEQ ID NO: 6: plasmid pGSV71 SEQ ID NO: 7: plasmid pRVA44 SEQ ID NO: 8: primer MDB327 SEQ ID NO: 9: primer MLD015 SEQ ID NO: 10: primer MLD016 SEQ ID NO: 11: primer MDB612 SEQ ID NO: 12: primer MDB053 SEQ ID NO: 13: primer MDB356 SEQ ID NO: 14: primer DPA017 SEQ ID NO: 15: primer MLDO19 SEQ ID NO: 16: sequence comprising target site deletion SEQ ID NO: 17: primer GHI01 SEQ ID NO: 18: primer GHI02

EXAMPLES Example 1 Transformation of Cotton with a Herbicide Tolerance Gene

[0072] 1.1. Construction of the Chimeric DNA Comprising the Bar Gene Under the Control of a Constitutive Promoter

[0073] A plasmid pGSV71 was constructed following standard procedures. The sequence of the genetic elemennts of plasmid pGSV71 is given in table 1 (SEQ ID NO: 6): TABLE 1 Nucleotide positions of the genetic elements in pGSV71 Nt Abbrevi- positions ation Description and references 198-222 — Right border Repeat from the TL-DNA from pTiB6S3 (Gielen et al., 1984, EMBO J. 3:835-846) 223-249 — Polylinker 250-1634 P35S3 Promoter of the 35S RNA of Cauliflower Mosaic Virus (Odell et al., (1985), Nature 313:810-812) 1635-2186 bar Coding sequence encoding phosphino- thricine-acetyl-transferase from Streptomyces hygroscopicus (Thompson et al., (1987) EMBO J. 6: 2519-2523). The N-terminal two codons of the wild-type bar coding region have been substituted for codons ATG and GAG respectively. 2187-2205 — Polylinker 2206-2465 3′nos A 260 bp Taql fragment from the 3′ untran- slated region of the nopaline-synthase gene originating from the T-DNA of pTiT37 (Depicker et al., 1982, J. Mol. Appl. Genet. 1:561-573) 2466-2519 — Polylinker 2520-2544 — Left border Repeat from the TL-DNA from pTiB6S3 (Gielen et al., 1984, EMBO J. 3:835-846)

[0074] 1.2. Transformation of Gossypium hirsutum

[0075] Cotton tissue from Coker3l 2 plants was transformed with pGSV71 using Agrobacterium transformation (U.S. Pat. No. 5,986,181) and regenerated to plants on appropriate media.

[0076] The small plantlets initiated on the selective regeneration media were transferred to new medium for germination (all medium is hormone-free). Plantlets were then transferred to the growth chambers or to the greenhouses.

[0077] Selection was done on phosphinothricin (PPT) at all stages except plantlet regeneration, which was done in the absence of PPT to accelerate growth. This resulted in a set of primary transformants (plants of generation TO).

Example 2 Development of Events

[0078] 2.1. Development of Lines Carrying the Event Trait

[0079] T0 shoots were transferred to greenhouse soil and plants were screened for glufosinate tolerance and for the presence of the PAT enzyme with a PAT ELISA (Steffens Biotechnische Analysen GmbH, Ebringen, Germany).

[0080] T1 to T3 plants were grown in the greenhouse and tested for Liberty™ tolerance at a 2× rate (by spraying 56 oz/ha). Positive plants were tested for expression of the bar gene using the Pat assay as described by Deblock et al. 1987 (EMBO J. 6:2513-2518).

[0081] Presence of the foreign DNA and copy number was checked by Southern blot analysis. Total genomic DNA was isolated from 1 g of leaf tissue according to the CETAB method of Doyle et al. (1987, Phytochem. Bull. 19:11) and digested with EcoRI restriction enzyme.

[0082] Probes such as the following were used for Southern analysis:

[0083] “bar” probe: 474 bp KpnI-BgII digest of plasmid pDE110 (WO 92/09696)

[0084] “35S” probe: 892 bp NcoI-MunI digest of plasmid pRVA44 (SEQ ID NO: 7)

[0085] T2 Plants were also evaluated for general phenotypic characteristics compared to the non-transgenic isogenic lines. In later generations, the lines for which no negative penalties on phenotype or agronomic performance was observed for the presence of the transgene either in hemizygous or in homozygous condition, as compared to wild-types were selected.

[0086] T4 material was grown in the field and tested under field conditions for Liberty™ tolerance according to different schedules.

[0087] In later generations, plants were compared to commercial varieties for yield, fiber quality and plant mapping data. Agronomic characteristics, such as plant height, height to node, boll retention, stand, vigor, fiber length, fiber strength and lint yield were evaluated.

[0088] It was determined that one event performed equally or better than the comparable checks and that for this event yield was dependent on background rather than on presence of the transgene.

[0089] 2.2. Selection of an Elite Event

[0090] This selection procedure, yielded one elite event which displayed optimal expression of the 35S-bar gene, i.e. tolerance to glufosinate ammonium, without penalty on agronomic performance and yield. This elite event was named EE-GH1.

[0091] 2.3. Testing of EE-GH1 in Cotton Varieties with Different Genetic Backgrounds and in Different Locations

[0092] The selected event was introduced into different commercial genetic backgrounds, including FM989, FM 832, FM958, and FM966 and results of field trials of four different locations were compared. Plants were sprayed with 1600 g.a.i./ha, using different treatments (1×3-5 leaf stage, 4×, 3-5 leaf stage, 1×+1×, 3-5 leaf stage, 4×+4×, 3-5 leaf stage, 0 as control).

[0093] Seedling emergence and vigor rating for the elite event was very good.

[0094] No visible damage as a result of herbicide application was ever observed after application regardless of rate or stage of development at the time of application.

[0095] There were no detrimental effects on morphology or growth habit of plants by herbicide application

[0096] Furthermore, the event had normal leaf, flower and boll morphology, excellent fertility, and showed no disease or abnormal insect susceptibility in multiple genetic backgrounds. During introgression into multiple genetic backgrounds no aberrant problems or abnormalities were observed over four generations.

[0097] 2.4. Genetic Analysis of the Locus

[0098] The genetic stability of the insert for the EE-GH1 event was checked by molecular and phenotypic analysis in the progeny plants over several generations.

[0099] Southern blot analyses of plants of the T1, T2 and T3 generation were compared for the EE-GH1 event. The patterns obtained were found to be identical in the different generations. This proves that the molecular configuration of the foreign DNA in EE-GH1 was stable.

[0100] The EE-GH1 event displayed Mendelian segregation for the transgene as a single genetic locus in at least three subsequent generations indicating that the insert is stable.

Example 3 Characterization of Elite Event EE-GH1

[0101]3.1 In-Depth Molecular and Genetic Analysis of the Locus

[0102] Once the EE-GH1 event was identified as the event in which expression of the transgene as well as overall agronomic performance were optimal, the locus of the transgene was analyzed in detail on a molecular level. This included sequencing of the flanking regions of the transgene.

[0103] The sequence of the regions flanking the inserted transgene in the EE-GH1 event was determined using the TAIL-PCR protocol as described by Liu et al. (1995, Plant J. 8(3): 457-463).

[0104] a) Determination of the 5′ Flanking Region

[0105] The primers used were: Position in Sequence (5′→3′) pGSV71 Degenerate MDB327 NTg.Agg.WTC.NWg.TSA.T (SEQ ID NO:8) — primer Primary TAIL MLD015 Tgg.TTC.CTA.gCg.TgA.gCC.AgT.g (SEQ ID NO:9) 606→585 Second. TAIL MLD016 AgC.TgC.TgC.TCT.TgC.CTC.TgT (SEQ ID NO:10) 467→447 Tertiary TAIL GHI05 ggA.CCg.TTA.TAC.ACA.ACg.Tag (SEQ ID NO:3) 358→338

[0106] The fragment amplified using MDB327-GHI05 was ca. 1200 bp which was sequenced (5′ flank: SEQ ID NO: 1). The sequence between bp 1 and bp 677 comprised plant DNA, while the sequence between bp 678 and bp 850 corresponded to pGSV71 DNA.

[0107] b) Determination of the 3′ Flanking Region

[0108] The primers used were: Position in Sequence (5′→3′) pGSV71 Degenerate MDB612 NgT.gCT.SWg.ANA.WgA.T (SEQ ID NO:11) — primer Primary MDB053 CAT.gAC.gTg.ggT.TCC.Tgg.Cag.C (SEQ ID NO:12) 2109-2130 TAIL Secondary MDB356 AAT.CCT.gTT.gCC.ggT.CTT.gCg (SEQ ID NO:13) 2252-2272 TAIL Tertiary TAIL DPA017 gAT.TAg.AgT.CCC.gCA.ATT.ATA.C (SEQ ID NO:14) 2362-2383

[0109] The fragment amplified using MDB612-DPA017 was ca. 400 bp, the complete sequence of which was determined (SEQ ID NO: 4). The sequence between nucleotide 1 and 179 corresponds to T-DNA, while the sequence between nucleotide 180 and 426 corresponds to plant DNA.

[0110] c) Identification of the Target Site Deletion

[0111] Using primers corresponding to sequences within the flanking regions of the transgene on the wildtype Gossypium hirsutum as a template, the insertion site of the transgene was identified.

[0112] The following primers were used: Position in Position in 5′flank 3′flank Sequence (5′→3′) (SEQ ID NO:1) (SEQ ID NO:4) GHI06 TTg.CAC.CAT.CTA.gCT.CAC.TC (SEQ ID NO:2) 815→795 - - - - - MLD019 CAA.gAT.gCg.AgC.AAC.TAT.gT (SE ID NO:15) - - - - - 285→266

[0113] This yielded a 200 bp fragment (SEQ ID NO: 16) in which bp 85 to 122 corresponds to a target site deletion.

[0114] Thus, the insertion region (SEQ ID NO: 5) as sequenced comprises: 1-677: 5′ flanking region bp 1 to 677 of SEQ ID NO: 1 678-714: target site deletion bp 85 to 122 of SEQ ID NO: 16 715-916: 3′ flanking region bp 180 to 426 of SEQ ID NO: 4

[0115] 3.2. Genetic Analysis of the Locus

[0116] The genetic stability of the insert was checked by molecular and phenotypic analysis in the progeny plants over several generations. Southern blot analyses on glufosinate tolerant plants of EE-GH1 cotton plants of the T₀, T₁ and T₂ generation were compared and were found to be identical. This proves that the molecular configuration of the transgene in EE-GH1 containing plants was stable.

[0117] The EE-GH1 event displayed Mendelian segregation for the transgene as a single genetic locus in at least three subsequent generations indicating that the insert is stable.

[0118] On the basis of the above results EE-GH1 was identified as an elite event.

Example 4 Development of Diagnostic Tools for Identity Control

[0119] A EE-GH1 Elite event PCR Identification protocol was developed to identify the presence of EE-GH1 in plants, plant material or biological samples.

[0120] EE-GH1 Elite event Polymerase Chain Reaction Identification Protocol

[0121] A test run, with all appropriate controls, has to be performed before attempting to screen unknowns. The presented protocol might require optimization for components that may differ between labs (template DNA preparation, Taq DNA polymerase, quality of the primers, dNTP's, thermocyler, etc.).

[0122] Amplification of the endogenous sequence plays a key role in the protocol. One has to attain PCR and thermocycling conditions that amplify equimolar quantities of both the endogenous and transgenic sequence in a known transgenic genomic DNA template. Whenever the targeted endogenous fragment is not amplified or whenever the targeted sequences are not amplified with the same ethidium bromide staining intensities, as judged by agarose gel electrophoresis, optimization of the PCR conditions may be required.

[0123] Template DNA

[0124] Template DNA is prepared according to the CTAB method described by Doyle and Doyle (1987, Phytochem. Bull. 19: 11). When using DNA prepared with other methods, a test run utilizing different amounts of template should be done. Usually 50 ng of genomic template DNA yields the best results.

[0125] Assigned Positive and Negative Controls

[0126] The following positive and negative controls should be included in a PCR run:

[0127] Master Mix control (DNA negative control). This is a PCR in which no DNA is added to the reaction. When the expected result, no PCR products, is observed this indicates that the PCR cocktail was not contaminated with target DNA.

[0128] A DNA positive control (genomic DNA sample known to contain the transgenic sequences). Successful amplification of this positive control demonstrates that the PCR was run under conditions which allow for the amplification of target sequences.

[0129] A wildtype DNA control. This is a PCR in which the template DNA provided is genomic DNA prepared from a non-transgenic plant. When the expected result, no amplification of the transgene PCR product but amplification of the endogenous PCR product, is observed this indicates that there is no detectable transgene background amplification in a genomic DNA sample.

[0130] Primers

[0131] The following primers, which specifically recognize the transgene and a flanking sequence of EE-GH1 are used: Position in Primer Sequence (5′→3′) SEQ ID NO: 1 Target GHI05 ggA.CCg.TTA.TAC.ACA.ACg.Tag (SEQ ID NO:3) 758→738 pGSV71 sequence GHI06 TTg.CAC.CAT.CTA.gCT.CAC.TC (SEQ ID NO:2) 815→795 Plant DNA Sequence

[0132] Primers targeting an endogenous sequence are always included in the PCR cocktail. These primers serve as an internal control in unknown samples and in the DNA positive control. A positive result with the endogenous primer-pair demonstrates that there is ample DNA of adequate quality in the genomic DNA preparation for a PCR product to be generated. The endogenous primers used are: GHI01: 5′-AAC.CTA.ggC.TgC.TgA.Agg. (SEQ ID NO:17) AgC-3′ (Alcohol dehydrogenase gene Acc. NO: AF036569, 107→1090) GHI02: 5′-CAA.CTC.CTC.CAg.TCA.TCT. (SEQ ID NO: 18) CCg-3′ (Alcohol dehydrogenase gene Acc. NO: AF036569, 1515→1495)

[0133] Amplified Fragments

[0134] The expected amplified fragments in the PCR reaction are: For primer pair GHI01-GHI02: 445 bp (endogenous control) For primer pair GHI05-GHI06: 269 bp (EE-GH1 Elite Event)

[0135] PCR Conditions

[0136] The PCR mix for 50 μl reactions contains:

[0137] 5 μl template DNA

[0138] 5 μl 10×Amplification Buffer (supplied with Taq polymerase)

[0139] 1 μl 10 mM dNTP's

[0140] 0.5 μl GHI01 (10 pmoles/μl)

[0141] 0.5 μl GHI02 (10 pmoles/μl)

[0142] 1 μl GHI05 (10 pmoles/μl)

[0143] 1 μl GHI06 (10 pmoles/μl)

[0144] 0.2 μl Taq DNA polymerase (5 units/μl)

[0145] water up to 50 μl

[0146] The thermocycling profile to be followed for optimal results is the following:

[0147] 4 min. at 95° C.

[0148] Followed by:

[0149] 1 min. at 95° C

[0150] 1 min. at 57° C.

[0151] 2 min. at 72° C.

[0152] For 5 cycles

[0153] Followed by:

[0154] 30 sec. at 92° C.

[0155] 30 sec. at 57° C.

[0156] 1 min. at 720C

[0157] For 25 cycles

[0158] Followed by:

[0159] 5 minutes at 720C

[0160] Agarose Gel Analysis

[0161] Between 10 and 20 μl of the PCR samples should be applied on a 1.5% agarose gel (Tris-borate buffer) with an appropriate molecular weight marker (e.g. 100 bp ladder PHARMACIA).

[0162] Validation of the Results

[0163] Data from transgenic plant DNA samples within a single PCR run and a single PCR cocktail should not be acceptable unless 1) the DNA positive control shows the expected PCR products (transgenic and endogenous fragments), 2) the DNA negative control is negative for PCR amplification (no fragments) and 3) the wild-type DNA control shows the expected result (endogenous fragment amplification).

[0164] Lanes showing visible amounts of the transgenic and endogenous PCR products of the expected sizes, indicate that the corresponding plant from which the genomic template DNA was prepared, has inherited the EE-GH1 elite event. Lanes not showing visible amounts of the transgenic PCR product and showing visible amounts of the endogenous PCR product, indicate that the corresponding plant from which the genomic template DNA was prepared, does not comprise the elite event. Lanes not showing visible amounts of the endogenous and transgenic PCR products, indicate that the quality and/or quantity of the genomic DNA didn't allow for a PCR product to be generated. These plants cannot be scored. The genomic DNA preparation should be repeated and a new PCR run, with the appropriate controls, has to be performed.

[0165] Use of Discriminating PCR Protocol to Identify EE-GH1

[0166] Cotton leaf material from plants comprising different transgenic events (samples 1 to 4) was tested according to the above-described protocol. Samples from cotton wild-type were taken as negative controls.

[0167] The results of the PCR analysis are illustrated in FIG. 1. Sample 1 is recognized as comprising elite event EE-GH1. All other tested lines do not comprise this elite event.

Example 5 Introgression of EE-GH1 into Preferred Cultivars

[0168] Elite event EE-GH1 is introduced by repeated back-crossing into the following commercial cotton cultivars: FM5013, FM5015, FM5017, FM989, FM832, FM966 and FM958.

[0169] It is observed that the introgression of the elite event into these cultivars does not significantly influence any of the desirable phenotypic or agronomic characteristics of these cultivars (no linkage drag) while expression of the transgene, as determined by glufosinate tolerance, meets commercially acceptable levels. This confirms the status of event EE-GH1 as an elite event.

[0170] As used in the claims below, unless otherwise clearly indicated, the term “plant” is intended to encompass plant tissues, at any stage of maturity, as well as any cells, tissues, or organs taken from or derived from any such plant, including without limitation, any seeds, leaves, stems, flowers, roots, single cells, gametes, cell cultures, tissue cultures or protoplasts.

[0171] Reference seed comprising elite event EE-GH1 was deposited as EE-GH1 at the ATCC (10801 University Blvd., Manassas, Va. 20110-2209) on Apr. 26, 2001, under ATCC accession number PTA-3343.

[0172] As used in the claims below, unless otherwise clearly indicated, the term “plant” is intended to encompass plant tissues, at any stage of maturity, as well as any cells, tissues, or organs taken from or derived from any such plant, including without limitation, any seeds, leaves, stems, flowers, roots, single cells, gametes, cell cultures, tissue cultures or protoplasts.

[0173] The above description of the invention is intended to be illustrative and not limiting. Various changes or modifications in the embodiments described may occur to those skilled in the art. These can be made without departing from the spirit or scope of the invention.

0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 85 <210> SEQ ID NO 1 <211> LENGTH: 2520 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (154)..(2376) <400> SEQUENCE: 1 aaaaatcgtc aatccctctc aaactcttct caccactaat ttcttcctct ggaacattct 60 cttctctatt attttgattc ccttggcctc aacactggtt tctcaattgc atgatcttgg 120 ctcgtcttca gttactttga ttcactgaga aaa atg gcg act gga gta ttg cca 174 Met Ala Thr Gly Val Leu Pro 1 5 gct ccg gtt tct ggg atc aag ata ccg gat tcg aaa gtc ggg ttt ggt 222 Ala Pro Val Ser Gly Ile Lys Ile Pro Asp Ser Lys Val Gly Phe Gly 10 15 20 aaa agc atg aat ctt gtg aga att tgt gat gtt agg agt cta aga tct 270 Lys Ser Met Asn Leu Val Arg Ile Cys Asp Val Arg Ser Leu Arg Ser 25 30 35 gct agg aga aga gtt tcg gtt atc cgg aat tca aac caa ggc tct gat 318 Ala Arg Arg Arg Val Ser Val Ile Arg Asn Ser Asn Gln Gly Ser Asp 40 45 50 55 tta gct gag ctt caa cct gca tcc gaa gga agc cct ctc tta gtg cca 366 Leu Ala Glu Leu Gln Pro Ala Ser Glu Gly Ser Pro Leu Leu Val Pro 60 65 70 aga cag aaa tat tgt gaa tca ttg cat aag acg gtg aga agg aag act 414 Arg Gln Lys Tyr Cys Glu Ser Leu His Lys Thr Val Arg Arg Lys Thr 75 80 85 cgt act gtt atg gtt gga aat gtc gcc ctt gga agc gaa cat ccg ata 462 Arg Thr Val Met Val Gly Asn Val Ala Leu Gly Ser Glu His Pro Ile 90 95 100 agg att caa acg atg act act tcg gat aca aaa gat att act gga act 510 Arg Ile Gln Thr Met Thr Thr Ser Asp Thr Lys Asp Ile Thr Gly Thr 105 110 115 gtt gat gag gtt atg aga ata gcg gat aaa gga gct gat att gta agg 558 Val Asp Glu Val Met Arg Ile Ala Asp Lys Gly Ala Asp Ile Val Arg 120 125 130 135 ata act gtt caa ggg aag aaa gag gcg gat gcg tgc ttt gaa ata aaa 606 Ile Thr Val Gln Gly Lys Lys Glu Ala Asp Ala Cys Phe Glu Ile Lys 140 145 150 gat aaa ctc gtt cag ctt aat tac aat ata ccg ctg gtt gca gat att 654 Asp Lys Leu Val Gln Leu Asn Tyr Asn Ile Pro Leu Val Ala Asp Ile 155 160 165 cat ttt gcc cct act gta gcc tta cga gtc gct gaa tgc ttt gac aag 702 His Phe Ala Pro Thr Val Ala Leu Arg Val Ala Glu Cys Phe Asp Lys 170 175 180 atc cgt gtc aac cca gga aat ttt gcg gac agg cgg gcc cag ttt gag 750 Ile Arg Val Asn Pro Gly Asn Phe Ala Asp Arg Arg Ala Gln Phe Glu 185 190 195 acg ata gat tat aca gaa gat gaa tat cag aaa gaa ctc cag cat atc 798 Thr Ile Asp Tyr Thr Glu Asp Glu Tyr Gln Lys Glu Leu Gln His Ile 200 205 210 215 gag cag gtc ttc act cct ttg gtt gag aaa tgc aaa aag tac ggg aga 846 Glu Gln Val Phe Thr Pro Leu Val Glu Lys Cys Lys Lys Tyr Gly Arg 220 225 230 gca atg cgt att ggg aca aat cat gga agt ctt tct gac cgt atc atg 894 Ala Met Arg Ile Gly Thr Asn His Gly Ser Leu Ser Asp Arg Ile Met 235 240 245 agc tat tac ggg gat tct ccc cga gga atg gtt gaa tct gcg ttt gag 942 Ser Tyr Tyr Gly Asp Ser Pro Arg Gly Met Val Glu Ser Ala Phe Glu 250 255 260 ttt gca aga ata tgt cgg aaa tta gac tat cac aac ttt gtt ttc tca 990 Phe Ala Arg Ile Cys Arg Lys Leu Asp Tyr His Asn Phe Val Phe Ser 265 270 275 atg aaa gcg agc aac cca gtg atc atg gtc cag gcg tac cgt tta ctt 1038 Met Lys Ala Ser Asn Pro Val Ile Met Val Gln Ala Tyr Arg Leu Leu 280 285 290 295 gtg gct gag atg tat gtt cat gga tgg gat tat cct ttg cat ttg gga 1086 Val Ala Glu Met Tyr Val His Gly Trp Asp Tyr Pro Leu His Leu Gly 300 305 310 gtt act gag gca gga gaa ggc gaa gat gga cgg atg aaa tct gcg att 1134 Val Thr Glu Ala Gly Glu Gly Glu Asp Gly Arg Met Lys Ser Ala Ile 315 320 325 gga att ggg acg ctt ctt cag gac ggg ctc ggt gac aca ata aga gtt 1182 Gly Ile Gly Thr Leu Leu Gln Asp Gly Leu Gly Asp Thr Ile Arg Val 330 335 340 tca ctg acg gag cca cca gaa gag gag ata gat ccc tgc agg cga ttg 1230 Ser Leu Thr Glu Pro Pro Glu Glu Glu Ile Asp Pro Cys Arg Arg Leu 345 350 355 gct aac ctc ggg aca aaa gct gcc aaa ctt caa caa ggc gca ccg ttt 1278 Ala Asn Leu Gly Thr Lys Ala Ala Lys Leu Gln Gln Gly Ala Pro Phe 360 365 370 375 gaa gaa aag cat agg cat tac ttt gat ttt cag cgt cgg acg ggt gat 1326 Glu Glu Lys His Arg His Tyr Phe Asp Phe Gln Arg Arg Thr Gly Asp 380 385 390 cta cct gta caa aaa gag gga gaa gag gtt gat tac aga aat gtc ctt 1374 Leu Pro Val Gln Lys Glu Gly Glu Glu Val Asp Tyr Arg Asn Val Leu 395 400 405 cac cgt gat ggt tct gtt ctg atg tcg att tct ctg gat caa cta aag 1422 His Arg Asp Gly Ser Val Leu Met Ser Ile Ser Leu Asp Gln Leu Lys 410 415 420 gca cct gaa ctc ctc tac aga tca ctc gct aca aag ctt gtc gtg ggt 1470 Ala Pro Glu Leu Leu Tyr Arg Ser Leu Ala Thr Lys Leu Val Val Gly 425 430 435 atg cca ttc aag gat ctg gca act gtt gat tca atc tta tta aga gag 1518 Met Pro Phe Lys Asp Leu Ala Thr Val Asp Ser Ile Leu Leu Arg Glu 440 445 450 455 cta ccg cct gta gat gat caa gtg gct cgt ttg gct cta aaa cgg ttg 1566 Leu Pro Pro Val Asp Asp Gln Val Ala Arg Leu Ala Leu Lys Arg Leu 460 465 470 att gat gtc agt atg gga gtt ata gca cct tta tca gag caa cta aca 1614 Ile Asp Val Ser Met Gly Val Ile Ala Pro Leu Ser Glu Gln Leu Thr 475 480 485 aag cca ttg ccc aat gcc atg gtt ctt gtc aac ctc aag gaa cta tct 1662 Lys Pro Leu Pro Asn Ala Met Val Leu Val Asn Leu Lys Glu Leu Ser 490 495 500 ggt ggc gct tac aag ctt ctc cct gaa ggt aca cgc ttg gtt gtc tct 1710 Gly Gly Ala Tyr Lys Leu Leu Pro Glu Gly Thr Arg Leu Val Val Ser 505 510 515 cta cga ggc gat gag cct tac gag gag ctt gaa ata ctc aaa aac att 1758 Leu Arg Gly Asp Glu Pro Tyr Glu Glu Leu Glu Ile Leu Lys Asn Ile 520 525 530 535 gat gct act atg att ctc cat gat gta cct ttc act gaa gac aaa gtt 1806 Asp Ala Thr Met Ile Leu His Asp Val Pro Phe Thr Glu Asp Lys Val 540 545 550 agc aga gta cat gca gct cgg agg cta ttc gag ttc tta tcc gag aat 1854 Ser Arg Val His Ala Ala Arg Arg Leu Phe Glu Phe Leu Ser Glu Asn 555 560 565 tca gtt aac ttt cct gtt att cat cac ata aac ttc cca acc gga atc 1902 Ser Val Asn Phe Pro Val Ile His His Ile Asn Phe Pro Thr Gly Ile 570 575 580 cac aga gac gaa ttg gtg att cat gca ggg aca tat gct gga ggc ctt 1950 His Arg Asp Glu Leu Val Ile His Ala Gly Thr Tyr Ala Gly Gly Leu 585 590 595 ctt gtg gat gga cta ggt gat ggc gta atg ctc gaa gca cct gac caa 1998 Leu Val Asp Gly Leu Gly Asp Gly Val Met Leu Glu Ala Pro Asp Gln 600 605 610 615 gat ttt gat ttt ctt agg aat act tcc ttc aac tta tta caa gga tgc 2046 Asp Phe Asp Phe Leu Arg Asn Thr Ser Phe Asn Leu Leu Gln Gly Cys 620 625 630 aga atg cgt aac act aag acg gaa tat gta tcg tgc ccg tct tgt gga 2094 Arg Met Arg Asn Thr Lys Thr Glu Tyr Val Ser Cys Pro Ser Cys Gly 635 640 645 aga acg ctt ttc gac ttg caa gaa atc agc gcc gag atc cga gaa aag 2142 Arg Thr Leu Phe Asp Leu Gln Glu Ile Ser Ala Glu Ile Arg Glu Lys 650 655 660 act tcc cat tta cct ggc gtt tcg atc gca atc atg gga tgc att gtg 2190 Thr Ser His Leu Pro Gly Val Ser Ile Ala Ile Met Gly Cys Ile Val 665 670 675 aat gga cca gga gaa atg gca gat gct gat ttc gga tat gta ggt ggt 2238 Asn Gly Pro Gly Glu Met Ala Asp Ala Asp Phe Gly Tyr Val Gly Gly 680 685 690 695 tct ccc gga aaa atc gac ctt tat gtc gga aag acg gtg gtg aag cgt 2286 Ser Pro Gly Lys Ile Asp Leu Tyr Val Gly Lys Thr Val Val Lys Arg 700 705 710 ggg ata gct atg acg gag gca aca gat gct ctg atc ggt ctg atc aaa 2334 Gly Ile Ala Met Thr Glu Ala Thr Asp Ala Leu Ile Gly Leu Ile Lys 715 720 725 gaa cat ggt cgt tgg gtc gac ccg ccc gtg gct gat gag tag 2376 Glu His Gly Arg Trp Val Asp Pro Pro Val Ala Asp Glu 730 735 740 atttcaaaac ggagaaagat gggtgggcca ttctttgaaa actgtgagag aagatatata 2436 tatttgtgtg tgtatatcat ctgtttgttg tgtattgcat catcattttg aacaaatgtc 2496 caaatctctt aagttgataa aagt 2520 <210> SEQ ID NO 2 <211> LENGTH: 33675 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (6924)..(7019),(7163)..(7269),(7344)..(7444), (7525)..(7634), <222> LOCATION: (7694)..(7813),(7923)..(8153),(8253)..(8369), (8515)..(8589), <222> LOCATION: (9012)..(9071),(9163)..(9225),(9328)..(9472), (9589)..(9730), <222> LOCATION: (9951)..(10028),(10134)..(10293),(10694)..(10798), <222> LOCATION: (11028)..(11129) <221> NAME/KEY: unsure <222> LOCATION: (1..33675) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 2 cttaaccctc gccgactgcc tggagattcg tgccgatcga tacacgtggc agcgcctaac 60 gcgtaacccc tccctcactt ggagattcgt gcaagcaact cgattaatgc attaatgctg 120 tcgcgtaggt ttccctacgg aagagctgag tttcgtaacg aaaaaaaccg gccacgtttc 180 gcatcgagcc tactttaatt agcgtgggaa aataattcaa agtagcgacc tgtaccctgt 240 ggcaacctag cgcgcgcggc catggctctt gttccgctcg tgacagtgct cctgttcgcc 300 ggctcatgcc tcggatcagc gccgccgacg acatcgccgg cggcgtcggc ggcgtccacg 360 gcgacacgta cggtagtagt cgacggcatt acggccatct acaaacctcg gcgactcgct 420 gtcggacacc gcaacctcgc caggcaaggc gccaccggcg ggctgctccg gtacaccacg 480 aggcttccct acggcgtcac cgtcggccgc gccaccggcc ggtgctccga cggctacctc 540 atcatcgact tcctcggtga cgtcatcagt ttaatttctc tctctcttcc gtctgaaaaa 600 tggaagaaac aatattatat tacgttatat atatatgcgt ttttgtttcg gattaaattg 660 tggatatgat cgatcgatgt gcagctagag atcttggcct ccctctgctc aacccgtacc 720 tcgacgaggg cgcggacttc gcccacggcg tcaacttcgc cgtcgccggc gccaccgcgc 780 tcaacacgac ggcgctcgcc gccaggcgga tcaccgtccc ccacaccaac agccccctcg 840 acgtgcagct cagatttttt ttgttttaga gaagggtatt ttttacccgg cctctacatc 900 caaccggata tatacggcta ttgaagtagg gaacttaacc ctgtaaacaa tccatccata 960 gaggatatga acctaagacc ttgaggtact acttcaaccg gatatatacg tgcagctcag 1020 atggttcaag gaattcatga actccacaac tagttctcct caaggtgaac gaacaaactg 1080 aaacgcattt cagcttaatt tcgaccggtg cctgatcagt gccagtcagc aatgctgtat 1140 ctcacaaata attaagctaa tgtacagctt ttcagtgcta gaatgacttt catatagaga 1200 aatcttgtgt tatatatata tacttttttc tgaaagaaaa aagttctttt gtgtgagcat 1260 tgcattgcag agatccgtga aaagctgtcg aagtcactgg ttatgctggg agagatcgga 1320 ggaaacgact acaactacgc cttcctccag acctggccga tggacggtgg atacagcctc 1380 ggcaacgtca cacgcatgat cgaaagcgtt gccacggccg tcgatcttgt accggaagtc 1440 gtgcagtcca tagccagcgc agccaaggta cacaccattc ttttccatta atttttggga 1500 ccttattttt aaaataataa tcctggctac aaagtaatta attaagaact aaattaattt 1560 ttgtgggttt tgtgacacag gaggtgctcg acatgggcgc gacgcgggtg gtgatcccgg 1620 gcaacctccc gctgggttgc gtgccgagct acatgagcgc ggtgaacgcg acggaccggg 1680 cggcgtacga cgcccgcgga tgcctcgtcg cgctcaacct cttcgcggcg ctgcacaacg 1740 cgtggctgcg ccgcgccgtc ggggagctgc ggcgcgcgta ccggggcgcc gcggtggtcg 1800 cgtacgcgga ctactccgcc gcgtacgccg cgacgctgga cggggcagcg gcgctcggct 1860 tcgacgagcg gcgcgtgttc agggcgtgct gcggcaaggg cggcgggggc gcgtacgggt 1920 tcgacgtgcg cgcgatgtgc ggcgcgccgg ggacggcggc gtgcgcggac ccggggaggt 1980 acgtgagctg ggacggcgtc cacctgacgc agcgcgcgta cggcgtcatg gccgagctgc 2040 tgttccgccg tggcctcgtg cacccgcctc cgataaattt cacgaacagc gcgcgcgcgt 2100 gaggcggtgt tgcatggctt gcgcgttttt tctgatcaaa actactcaag tttgagccgt 2160 tttgatttat aaataaaacc atatgcgatt ttgctaaacg tttgtcgcgt gatttctctt 2220 cggaagaaaa aatctcaccc gagtgatgca taggcggtcc caaccatatg tgccctgacc 2280 tttctctgct tccttcgcgt cgtgcactga caacctcaca gtatgttttt ggtatgggcg 2340 cttgcggccc aactcaatct gtaatacatt gggctgtcgt attgggtttg ttggacttca 2400 tagactggat cggagaaagt tgggtaattg actttttcat ttttgctata aaatgattaa 2460 ttaaacagtc taggataatt actgtagact ctaataatat tgtttggtta agtattatta 2520 tacattcctg tatttgacac tctaagagca tggccaagag ttgcctgaaa gtctcttcct 2580 aaatctgcct ttcattctct aatgagaatt taaggattaa aaatatactt attttcaata 2640 gacagcataa atttaattcc ctagaataaa aaaatgcccc cctaacaaca gaaattagat 2700 tcctctaccc gcacctcatc agatcgctcg atttaagatc acgccatctg acaccgccct 2760 cccgctcgct cttctctagt gtgggagtct cgcgctcaag agacggaaat cgggaacaag 2820 aatgattcct agcttagcga gaatgaaggg gaagacatat gtcataccta cacccacata 2880 agtatgccct agcacaaggg atgaaaacgg atcgaaaacg gatggaaact agctttatca 2940 tattcgtttt catttttttt tcggaatcgg attcgaaatc gaaaactcgg atacggaaat 3000 aaaattgaat attatcgaat acagatacgg agcgaatata agatggaacg aatacagtag 3060 cgaatattta ccggtatata aaaaacccct caaattgagt ttcttgatta agaaagagat 3120 atcgcttatt attttagtta aatatctcca acatttatat cgtcaatttt atagacggtt 3180 ccacaatcgt atgtgaaaat cgattttcat ggttgttcct ctaagagatc catatgcaaa 3240 tatgattatc attttctatt ccaagacctt ttactagatg tataacttat ttaccattgc 3300 ataaattgga gatgttattt attttacttc acatcttcga aacttgtaat gtatgtatta 3360 tactttaaat gctttcaagt acaaatgtta taaactacaa agtggtagat cccgttgagc 3420 tctacaactt tgatatggaa cacatctcca tcagatgtcg tttgaattgt agatctgaga 3480 ttttgtaaaa tttaatatgg tatattataa tgaatattta gacccttaaa tgaccttaaa 3540 taataaaata gtcaataata aagttgtaga tctcatcgag ctctataatg ttgatatgaa 3600 gtttgtcttc atctgattcc gtatgaaaaa gttatgtata tatacatgtt tttttataaa 3660 atttgctcaa tatctgcgga tatccgaaaa aaatttcgga tagtttttaa ccgtttttcg 3720 attccgatgg atagtatcct tactgtattc gttttcgttt ccgagaaaaa atatccaaat 3780 tcgtttccga atccgagaat ttttggataa ttccgacaga aactatccga atccgaaaaa 3840 tggttcggac ggacggaaac tatccaaacc agtttcatcc cggctagcac gcatttaaat 3900 tcacatgagg ttgcacattt atctgaggta aaaagattgg aaacggttac tggttcgtca 3960 agaattttcc gtatttatca gtataactat tcaatgacga catcaacata acagaaaatt 4020 aaaacaacat gagtcgattt tatatataac tagaaacgaa aacagtataa ctgttacgaa 4080 aacactagat tgatgggtcg aaaatttcca ccacggtttt tatgcctacc tttcaacgct 4140 cccaaagttc ccacgaccca aaacatgtgt gggagaactt ccgcccacat ggagacggtt 4200 gtctcaggga aacgtgccat ctgctttgct ccaggtcaac acatgtggtg tgactgaact 4260 ggccatcgtc tcaatattgt catctacccg tcataccatg ccaccggacc agaaggtgat 4320 tatggtcttc ggcggccgtc gcgcgcggat gccttgctcc acaacaagtc agccgctcaa 4380 accacactcc cctttggcat tgaacatgag gtttgacgac gatgtgtgtg tatgtttggg 4440 caggtagctt tgtttcaagc tgcactagct aattaagatc gatctccttg tcaaagtcac 4500 gatcaaacat cgaaagtaca tgcatggaag aaatgttgaa atgtaatgaa ctaaatgatg 4560 tccttttctc cccttattaa acaacatcaa gtttctttta tttctaaaga atgttaatat 4620 cctttttatt tcttcaataa atagtactgc actccctatg gtttttgttg tttagcatct 4680 tgactttcgg gcatacgttt tatgatttat cttattacaa aatataatta tcatttattt 4740 tatcattaca aatactttaa aaataacatt atcagctgat tttgaattaa aactaaaatt 4800 acaccttaat tacaatatac ttcacatagc aattataata taactatata caacttacac 4860 tataagttat gttcaaaata tttttcctac aaaaactatc accagattct tagacagtcc 4920 cattccacca cctcagctgc cgtgaaagaa ctttgggtct taaataagtc caaatttatc 4980 tttttgtttt ctcaataaaa tattcgaatt atccaacaaa tcaaggaaaa aacatccttc 5040 gatgacccat gaatattcgt gaagtttctc ctctagccag taacaatacg gaacaatcag 5100 acaattttat ctggctcaag caccatctct cgcaccagat taaactattt ttttttcatg 5160 gtacaataca atcccatgcc ggccacgaaa aacaaatggc agaaataata aacgaacaaa 5220 acagcctctc tccatcgtga actaataaaa aataaaataa aaacaaaaca aaatgataat 5280 ggaattacga agcgcatggg aaaacgacgg gcacgattaa atcatggcgg ggagagcccg 5340 gaaccccact tccacacctc caaccccacg ccgtcagcct tcccctccca tgcacccggt 5400 ccaccaacac ctcatctctt ggaccccaca cgcagccact gcccacggca acgcggtgct 5460 cgtgcaccga gtccacacga cgcgccgcgc ggtgcggggg cgccggcctc tggggataaa 5520 tgggctaatc cggtagaaag cccaccactc gctcgccagt tcgtcgtcct cttcgccgag 5580 ctcgcgagct ctcgcactct gtctccatcc ccgcatcgca tcgcctcgcc gctgctgatc 5640 tcgtcgcggt cgccggaggg gagctacgag gttggggagc cttatctcta cttcctgaga 5700 tttctagtag ctttgtgtat gtgtgtgtgt ttgtgtgttg gggggacgcc gatcgggtgg 5760 atcctcctgt ggtggttggt tgggcgcaat tcgtgcttgg tttatttgct ggaattctag 5820 cgggggagct ggcgttgtcg gtgctaattg ctgcggggga gctgctggaa ttcgtgcttc 5880 tgcttgggaa ttagaaggtt tgggttttta tgattcagag ggctgtagag ctcttgagat 5940 tggctgcgaa aattcgggat ttgatcaact tagagagcat tatctttgga ttaggaggga 6000 tttttcttaa tttttcttag ttttttttga gctatcaaga gttcatgcca tcttatttct 6060 ccctttgttc ttagccggaa ggatacacga atcagttttt tttttttaaa aaaaatattt 6120 atctcaattt tctgcaagca tgttcaattt ctaagtggaa atgctattta aaagaccagg 6180 cttattgatt ggtgctatac tttgattttc tttggaattg tagtagaagc atcagtttct 6240 tcatgctgtc ctaccaacct ctcttattat tagcaaagta aagttattaa atttgctaat 6300 tgttgatatg tcagtatttt gtacgaattg tgaaatagtt aattttcaat aactacacac 6360 catggttgtc ctgttgttgg actggaagca ataagggaat attccatttc tgtccattaa 6420 aacccacaaa gatgaccctg tgctcatctc taccattgcc atgcacctgt ttgtaggatt 6480 gcctaaccca gaagttggtg cttcgagata gccatggcca ccggagtggc accagcgccg 6540 ctcccacatg tcagggtccg tgatggtggc atcggcttca cgaggagcgt cgactttgct 6600 aagatcttgt cggttcctgc tactctaagg gtgggctcat caagaggcag ggtgcttgtg 6660 gccaagagct caagtaccgg ttctgatacc atggagctcg agccatcttc agaaggaagc 6720 ccacttttag gtataactcg ccggctgttg ttcaccttgc atgtatattc gtgttagttg 6780 ttcttagtgc ttttaactga atgaacattt tttctgtaaa gaatctgaca gcatgtcttt 6840 tgcccttttg ttattcttta gttcccaggc aaaagtattg tgaatctata tatgagacaa 6900 ggaggagaaa aacccgcact gtg atg gtt ggg aat gtg cca ctt ggc agt gat 6953 Met Val Gly Asn Val Pro Leu Gly Ser Asp 1 5 10 cat ccc att agg att cag act atg acc acc tcg gat acc aag gat gtt 7001 His Pro Ile Arg Ile Gln Thr Met Thr Thr Ser Asp Thr Lys Asp Val 15 20 25 gct aaa acc gta gag gag gtacactcct atttgaagtt ctatgtttta 7049 Ala Lys Thr Val Glu Glu 30 gtttttaatt ctatgcttga ataattgaat gctgggcatg cattaatcat gtgttctttt 7109 agatgttcta tgtttcatga ctagtgaaat aacgaagtat agcactggtc cag gtt 7165 Val atg agg ata gca gat aaa ggg gct gat ttt gtt aga ata aca gtc cag 7213 Met Arg Ile Ala Asp Lys Gly Ala Asp Phe Val Arg Ile Thr Val Gln 35 40 45 ggt aga aag gaa gct gat gcc tgc ttt gag att aag aac act ctt gtt 7261 Gly Arg Lys Glu Ala Asp Ala Cys Phe Glu Ile Lys Asn Thr Leu Val 50 55 60 65 cag aag aa gtaagagtca tcatttttcc agattcagtg agttttcatg 7309 Gln Lys Asn aatgaattct catcttgctt ttgcatttca acag t tac aac atc ccc cta gtg 7362 Tyr Asn Ile Pro Leu Val 70 gct gat att cat ttt gcc ccg aca gtt gct tta aga gtg gct gaa tgc 7410 Ala Asp Ile His Phe Ala Pro Thr Val Ala Leu Arg Val Ala Glu Cys 75 80 85 90 ttt gac aaa att cgt gtc aac cca ggg aat ttt g gtgagtgaaa 7454 Phe Asp Lys Ile Arg Val Asn Pro Gly Asn Phe 95 100 taatgatgtg tatcatttta gtgtcaatat cttatcaact ctgtgcatat gctgagaact 7514 ctacttgcag ct gat cgc cgt gcc caa ttt gag cag ctt gaa tat act 7562 Ala Asp Arg Arg Ala Gln Phe Glu Gln Leu Glu Tyr Thr 105 110 gaa gat gat tat caa aaa gag ctt gag cat atc gag aag gtt cca aat 7610 Glu Asp Asp Tyr Gln Lys Glu Leu Glu His Ile Glu Lys Val Pro Asn 115 120 125 130 atc tca ctc ttt agt gtt aat tta gtcagtaaga atgtgcagta tgtttcctta 7664 Ile Ser Leu Phe Ser Val Asn Leu 135 cttgcatagc cacttccata tcatttcag gtc ttc tcc ccg ttg gtt gag aaa 7717 Val Phe Ser Pro Leu Val Glu Lys 140 145 tgc aag cag tat gga aga gca atg cgt ata gga aca aat cat gga agt 7765 Cys Lys Gln Tyr Gly Arg Ala Met Arg Ile Gly Thr Asn His Gly Ser 150 155 160 ctg tct gac cgc ata atg agt tac tat ggt gat tct cca cgc gga atg 7813 Leu Ser Asp Arg Ile Met Ser Tyr Tyr Gly Asp Ser Pro Arg Gly Met 165 170 175 gtattatttc ctttctgggg atttcattca aataactttt cgtttcatgg atgtcttcaa 7873 ttaatgatcg ttttgataga tgaatgacat gttctacaaa taatttcag gtt gag tct 7931 Val Glu Ser 180 gct ttg gaa ttt gcc agg atc tgt cgg aag ctg gac ttc cat aac ttt 7979 Ala Leu Glu Phe Ala Arg Ile Cys Arg Lys Leu Asp Phe His Asn Phe 185 190 195 gtg ttt tca atg aaa gca agt aac cct gtt atc atg gtc caa gca tat 8027 Val Phe Ser Met Lys Ala Ser Asn Pro Val Ile Met Val Gln Ala Tyr 200 205 210 cgc ttg ctt gta gca gaa atg tat aac cta ggg tgg gat tat cct ttg 8075 Arg Leu Leu Val Ala Glu Met Tyr Asn Leu Gly Trp Asp Tyr Pro Leu 215 220 225 cac ttg gga gtt aca gaa gct gga gag ggt gaa gat ggg agg atg aag 8123 His Leu Gly Val Thr Glu Ala Gly Glu Gly Glu Asp Gly Arg Met Lys 230 235 240 245 tct gcc att ggc att gga aca ctt ctg atg gtaattgcat ttttactttg 8173 Ser Ala Ile Gly Ile Gly Thr Leu Leu Met 250 255 tgtattatat tgcatatatc atatctttcc atctgcaaag ggtaagcatg ccttatgtct 8233 tccttttgtt gtcttacag gat ggc ttg ggc gat aca atc cgt gtc tcc ctc 8285 Asp Gly Leu Gly Asp Thr Ile Arg Val Ser Leu 260 265 acg gaa cca cct gaa gaa gag att gat cct tgc cgg aga ttg gca aat 8333 Thr Glu Pro Pro Glu Glu Glu Ile Asp Pro Cys Arg Arg Leu Ala Asn 270 275 280 ctt ggc aca cat gcc gca gac ctt caa ata gga gtg gtaacgattt 8379 Leu Gly Thr His Ala Ala Asp Leu Gln Ile Gly Val 285 290 attacctttc tctagtttta cacttttctc ttgtttagct gccaatgcca cacattaatt 8439 ttgactattt ttagtagtgt tttgttctat ttgttctttt aagaatttct atttatatac 8499 attatatgtt ctcag gct cct ttt gaa gaa aag cac agg cgc tat ttt gat 8550 Ala Pro Phe Glu Glu Lys His Arg Arg Tyr Phe Asp 295 300 305 ttc cag cgt aga agt ggt cag ttg cct tta caa aag gag gttagttcaa 8599 Phe Gln Arg Arg Ser Gly Gln Leu Pro Leu Gln Lys Glu 310 315 aataactcct atagtccata gttatcataa aaacaatagt gctagatttc ttattagttg 8659 cacttatgac agggtgagga agtagactac agaggggtct tgcaccgtga tggctctgtt 8719 ttgatgtcag tttccttgga tcagttgaag gtaactcaca tatttgttac ccttttgtgc 8779 aatgtgttga tcttgtgtaa ctttaccaaa atatatttca agacaatagt ctattttgta 8839 atatacaatt ctacaacatg atattttcag tagccatgtt ccatgcattc tatgcatagt 8899 tcatagtaca tagtgagaat agcaatagca aaaagaaggc attgattttt ttctatctga 8959 atcaaatcaa ttgatgcatt ttgtaatgat ggaaggctct cttatttttc ag gct cct 9017 Ala Pro 320 gag ctc ctt tat agg tct ctt gct gca aag ctt gtg gtt ggc atg cct 9065 Glu Leu Leu Tyr Arg Ser Leu Ala Ala Lys Leu Val Val Gly Met Pro 325 330 335 ttc aag gtctgatcct tatagctgta cattctagca aacaactaaa ctttattggt 9121 Phe Lys acttcagtct aaactgatgt taatttttct atgaatatca g gat ctg gca act gta 9177 Asp Leu Ala Thr Val 340 gat tct att ctt ttg aag gag ctc cca cct gta gaa gat gct caa gct 9225 Asp Ser Ile Leu Leu Lys Glu Leu Pro Pro Val Glu Asp Ala Gln Ala 345 350 355 360 gtgagttcct tcaacattat ttgttctttt cacaaatcac aagcttatat taacattcta 9285 ttcctttaaa atttttgtgt tgaaatctgt aaaatggtac ag agg ctt gca ctc 9339 Arg Leu Ala Leu aaa aga tta gtt gac atc agc atg ggt gtg ttg act ccc tta tca gag 9387 Lys Arg Leu Val Asp Ile Ser Met Gly Val Leu Thr Pro Leu Ser Glu 365 370 375 380 caa ctg aca aag cca ctc cca cat gca att gct ctt gtc aat gtg gat 9435 Gln Leu Thr Lys Pro Leu Pro His Ala Ile Ala Leu Val Asn Val Asp 385 390 395 gaa ctg tca agc ggt gca cac aaa ctt ttg cca gaa g gtagacattt 9482 Glu Leu Ser Ser Gly Ala His Lys Leu Leu Pro Glu 400 405 gaatttgata atgatctttg ttgttttgtg aattgtgttt atgtcatttt ctgtatttta 9542 acattttgct tagtctgttt tattgatgaa tctttttttt atgtag gc act aga 9596 Gly Thr Arg 410 ttg gct gtc acc ctt cgt gga gat gaa tca tat gaa cag cta gat ctt 9644 Leu Ala Val Thr Leu Arg Gly Asp Glu Ser Tyr Glu Gln Leu Asp Leu 415 420 425 ctt aag ggt gtt gat gat ata aca atg tta ctg cac agt gtt cct tat 9692 Leu Lys Gly Val Asp Asp Ile Thr Met Leu Leu His Ser Val Pro Tyr 430 435 440 ggt gaa gag aag act ggc aga gta cac gct gct agg ag gtaagtgaac 9740 Gly Glu Glu Lys Thr Gly Arg Val His Ala Ala Arg Arg 445 450 455 acagtaggcc agttaatacc actccctcca ttattaccat ttgttgggat gaaccgatag 9800 tcaattctaa gttacacatt aagcatgaaa aatgaaaatg gatttgactc tgcagaaaac 9860 tgacatacag accaatgttt ccacctggtt ttccattgtt ctgtacttct ctttacctaa 9920 aattttattt tttttaataa tgttttgcag g tta ttt gag tac tta gaa acc 9972 Leu Phe Glu Tyr Leu Glu Thr 460 aac ggt ttg aac ttc cct gta atc cat cac ata gaa ttc ccc aaa agc 10020 Asn Gly Leu Asn Phe Pro Val Ile His His Ile Glu Phe Pro Lys Ser 465 470 475 gtg aac ag gtactatgaa gtgcttatta agagatgcat tgaccgccca 10068 Val Asn Arg 480 tccttacccc ttgaaattac tgtaccttta ttctcttgtg cttatttgag ttaaattata 10128 tgcag a gat gac ctt gtt att ggt gct ggg gca aat gtt ggt gct ctt 10176 Asp Asp Leu Val Ile Gly Ala Gly Ala Asn Val Gly Ala Leu 485 490 495 cta gtt gat ggt ctt ggt gat ggt gta ctt ctt gaa gct gct gac cag 10224 Leu Val Asp Gly Leu Gly Asp Gly Val Leu Leu Glu Ala Ala Asp Gln 500 505 510 gaa ttt gag ttt ttg agg gac aca tcc ttc aac ttg tta cag ggc tgc 10272 Glu Phe Glu Phe Leu Arg Asp Thr Ser Phe Asn Leu Leu Gln Gly Cys 515 520 525 agg atg cgc aac aca aaa acg gtaagctgat gaattcttct ctgttagact 10323 Arg Met Arg Asn Thr Lys Thr 530 535 gtagatccca tgaacaacgt caacctttaa ctcgtgagat atcatgaaga agtgcaaaat 10383 tgcactttta acagtaaatg aaccttatag cctaccgaag aggataaata actttaggca 10443 attctctctt gtgaagcaga acattctttt ggcgatttct gaccgttaat taatgctgca 10503 ggaatatgtc tcttgtcctt cttgtgggcg gacactcttt gacctccaag aagtcagtgc 10563 tcagattaga gagaagacct ctcatctgcc aggcgtctct gtaaactctc ttacagacct 10623 tctgcctccc ttgttttcaa tcgcatatta gctagcctga tggctaatca tgtctacatt 10683 tgcctggcag att gct atc atg ggt tgc att gtc aat ggg cca ggg gag 10732 Ile Ala Ile Met Gly Cys Ile Val Asn Gly Pro Gly Glu 540 545 atg gcc gat gct gat ttc gga tac gtt gga ggt gct cct ggg aag atc 10780 Met Ala Asp Ala Asp Phe Gly Tyr Val Gly Gly Ala Pro Gly Lys Ile 550 555 560 gac ctt tat gtt ggc aag gtaacctttt cctatacttg tggaagttga 10828 Asp Leu Tyr Val Gly Lys 565 570 atcatatcaa atggaataat ggaaatcacg gtatatcgtt gaacatagct gcaagtcaat 10888 atttgtacat gatcatgcaa acacaatcaa cagtagggat gttaactgca tggcatatat 10948 atgctctttg agctgaaaca aaaacttaga gctgccattt tccttccatt aacacaagtt 11008 ctacttgttt tgggtgcag acc gtc gtg caa cgg ggc att gca atg gag ggg 11060 Thr Val Val Gln Arg Gly Ile Ala Met Glu Gly 575 580 gcc act gac gcc ttg att cag tta atc aag gac cat ggc cgt tgg gtg 11108 Ala Thr Asp Ala Leu Ile Gln Leu Ile Lys Asp His Gly Arg Trp Val 585 590 595 gat cct cct gtt gag gag tag gccgtagcat gtagttcata tatgtactcc 11159 Asp Pro Pro Val Glu Glu 600 tccataaaca atgttgtagc tgaggcacat tgtattgtat ccacggagta cataaataca 11219 cgttctgtac atcagtttag aaataaagta ggaatagggg tggctgcaac tttgtaacac 11279 cctcgtgaag catcggcaaa tccaaattag aagcgtcctg aaatcagtga aaaagaattg 11339 atactgctat tttttgtacc aattgaaaaa aaaaaggaat acatgatatg actaaatcat 11399 gggttacatc ttcgtcaaaa aatgtcacag cttacattat tttcactact tgcaaatacc 11459 agacgatcta ctggtgcggg aacttgacgg gtgcaggaga cgcgaagccc ttgtggtaga 11519 gaagctcggc catgacgctg tacgcgcgct gagtcaggtg gacgccacgc catcccagct 11579 gatctgctcc atctcgaagt tgtacttccc gccgcagcac gccttggtca gcgccacgcc 11639 gtcgaacccc gtgtcgcgcg cgccctccag catccgcacg tacgcgccgg agtagtcggc 11699 gtacgcgatc gtggcctccg gttatgaccg cctcagctcc cggatcccct gctgcagcag 11759 cacgttgtgc atctgcgcga acaggttgag acccacgagg cacccgttcc cgtcgtacgc 11819 cgcgcgctcc gtctcgtcca ccgccgccag gtagctcggc gcgcaaccca gcgggaagtt 11879 gcccgggatc accacccgcg tcgcgctcat ctcgagcacc tccctcgccg cgctcaccac 11939 gcgaccgcac cacctctggt acgagcacca ccgactccac cacgccggtc atcatgcgcc 11999 cgacgtccgc gcggcgctac gacctcctgt tcgcggcctg ttcgcggcga tctctcccac 12059 catcaccagc aagctcgcgc cagcttctct tgctcgagaa ttttcagaat atgccaccga 12119 atatgcaccg ttttcaggat agaccactca attcgcacta ctttcataat atggcatttg 12179 gacgcgatat tttcttcgtt ccgtgacact ctcatccttc caccgtcagc gccagtaatt 12239 ccgttcgcac accaacagct ctctctgagc gtccagctcc agtgggggag ttgttggtgc 12299 gcggcgcggt aacaccgatc ctcgcgaggg ccgccgcgtc gagggcggtg gcgccggtga 12359 cggcgaaagt tgacaccgta ggagaagtcg gcgcctttgt cgatgtacgg gttgagcagc 12419 ggcagcccta ggtcgttggc gaggtagtcg atcatgaggt acccgtcgtc ggagcactgc 12479 cccgtggcgc tgccgatggc cgcgccgtac gtagggaggc gccacggtgt gctccatcaa 12539 ggcgaggaag ttgccggtgt ccgagatgga gtccccgaag ttgtagatgt ccgtgatgcc 12599 gtccaccacc gcccccttcg ccgccgatga caacgacgac gacgcggcct tccccggagc 12659 cggccttgcc tggcaagtgc cgacgaggag cagcgccaag aacgcgacga ggattggatg 12719 aaccggccta ctcgccatgg cgctcggtgc aagtgcaagt gggtgcgacg cagcagttgt 12779 tgtggcatgg cgcgcgcgcg gtgtggaatt cgattggaaa cgatttaagc tgagacatag 12839 tccaactccg aaacccaaat taaccataca tacagtgata caggtgaatc gacgagatga 12899 tcatgcacta cttaaaaaaa accgtcaaaa cacatttttg taggcggtca aatactctat 12959 gtacttaaag gcctgcgaaa ataacgcccc aaaagtcgtt tcttagtagt gatgcatacg 13019 caattgctgc aataacttaa aaagggtgat ttttattgca tcaacgtaac acgtacactg 13079 cattagtcct cctacattga aagcacaaat taaaccagta tggttgcaac ttgagacaca 13139 caaaggtgat cgatcgagaa ggttagctat aaacagcacc ccaaatggca cgaattaata 13199 atgtagttct ttctgcatgc tgaccaaaat ttcattttct ttttctctcc cctcgtcatt 13259 aaaaaaaagg tttaaagaca gaattacaag ctaattaatc atcagtggat cgagaattaa 13319 ttaagggatc acaatggctg caccccgcta tttcggagta gctagctcca tgcactcact 13379 catgcatgca ggcatgcata tacatgtccc ttgccatgtc ctatctaaca atttacacat 13439 ttcgacaaaa tgctcacggt cgatttggat tgtgtcactg acattaattg gttcatgcat 13499 ccacgcatgc gttactctca aggaaatatg aaagtatcat ccgtaatcag ggttccaaac 13559 taaggataga tacctttcan nnnnnnnnnn nnnnnnnnnn nnnnnnnnag gcctgctgca 13619 gcaagtgcac ttctcctgct catgcttcag agcctgcacg cagaaagacg acacaaaatt 13679 caaaagttta tatcgcttct gttttggagc ctcggctaaa aaatgaaaat atgaacaacc 13739 aaaaaaggca acacgtacga gttctaacca agtatataac cattataatg gcaaatgtga 13799 tctatacttt tgtagacgaa gacaattaat gatagtacca gtgaatatgc tagctatata 13859 cttttatcaa ctacttatcc gatcaatatg cttcagcatt acaaactagt tcttatatat 13919 atatttcttc tatcttattt catctctaaa atacaaagtt tatagtgtaa agagatcccc 13979 agggatgaat atatcttcta acacacctcg tagttaattt gttccaaaca atactagcat 14039 gcatataatt tgtagttatt tgtagcaaag cacggctatt tcgctaacaa atctaaatag 14099 aaaatatgtt atctctcagc cttgagaggt gtattaatta ccagcccata catcacttga 14159 gagggaaaag atttaaataa gacaaattga ttagaacaaa agggaatgat agacaatgtc 14219 ggtttttttt cgtttcttcc tttccttcgc ataggctcgt ctagctggtt gcgttatgta 14279 acaaaacctc ttttcctttt aatatattga tgggcgcgcc ttttgcgcat tcacgaaaaa 14339 aaatgtaaat gtgaattttc aatcttatcc cctacttgcg ggattagtcc ttgtgaagaa 14399 atcctcaaat atgcgtacct gcagctggct ctgcagaccc ttgatgtgct caactgcaag 14459 gtccaacatg tccgctgtgc ttgtttgctg ttgcaacacg aacataatta attactcaat 14519 tggttgcatt attcatgcgc aaaaaatgtt accgctaatt aatattagct agaactagat 14579 gagagaacgt acgacccctt tcatctatat acaataatca tgaatttgtt gagaaagcat 14639 gtttggtatg gtgttggagt tgtggctgtc atgcaccaaa gctctaatct cagtgcctat 14699 agaatttaac tacacaaaca tggatacgct ttttctagaa attctattag gttatgattt 14759 tgcgcttggt gtccatgaat ttgttgagca tgtgttaagg gacacttcac agtgcacact 14819 catgggtgaa tgcgtgtgca tttgccatgt ctattattaa ggcgagaaac atgaatctgt 14879 gtgctaatgg cacaagaaat gtggaaagtt tttttttaaa agaaaatact tagctaggga 14939 tgttcctttc ttcctcaaat atcatgtaaa tataggtatg aacattatgc aaagttcaaa 14999 tcgtaatggc caccttgtcc atgttgggca ccagctcctg cagcttcctg agcttctcgc 15059 taattctcgt cctccgttcc tacggacgcg catcgatcac accgacgtac atgctcatgt 15119 gtcaagatct gaagagaaag caaaagcaaa tatagaggcg ttttgatcat gatattgcgt 15179 acgtaccctc tccgcgatgc tcctggggtg cgtcgcgcag ccgcgcttgg cccgcacttt 15239 gaacggcacc tggtcatgct gcagctgcag gtacctgtcc atgccggcca tctccagcgc 15299 cgacgtgctc gccatgccgc cgaactgccc ccatttccaa cacgcccaag aaatcagaac 15359 acatcgcgat atatatatat atatatatat atatatatat atatatatat atatatatat 15419 cacaaacaca gcaaagctag ctactacttc ttcctctgtt ttacattaat tattataagt 15479 tgttttgagt tttgaataga ttcatacatg tataaatgta tgtgtttcat acatgtgtcc 15539 aaattcttat gaatgttagt aaatataaac aagggatgaa gagatcaaga agccttgtag 15599 tgtacaatga ttcaatgaag gtagccctag caatcaaatt tgccgagcaa tctttacctg 15659 ggactcgtac ccgccgaggg tggagatgat gtccctggac tcctcccacg gcccgacgat 15719 ggagaacccg ccgccgccgc tgctgccgcc ggcggagaag gtgcggggca cggaggcctc 15779 ggcgccggcg cggtcgggga aggcgccgtc ctccgcgatg tgcgagaggt gcggcggccc 15839 ggccgtgaag ctcagctggg acttcatctt cctcccgccg ctgctgctgc cgctgccgct 15899 gccggccatg gaagggtggt ggtgggcttc ggctccgctg cctcccccgc cttttgagcc 15959 tggaaagcct gcttagttta ttgccaagta gcaagcacgg aaattaacta atgatcgcta 16019 attagttaaa ttaactgtgt gtgtgagaga aagagctact gttacccaaa cgctagttga 16079 aaactgccaa gtgtgacaag taaacaatag tttacggtat tagcataccg ttagagctag 16139 ctctataggt acacgtgttg agcaataagt ttaacctaga tgtgatggga tgttcaaact 16199 tgcttctcca aggttgaatg gagtagtgtg tatttgattc tacaatattt ttctgtagta 16259 ggtgcacgta attaaggtta ggtttgattc tcatgttcaa atgtgtgttt aactgcaggt 16319 gtaatgttat atatgcatag tggttctata aatattttca taattaaaca ctaccaaatt 16379 tctatttgaa atccatgtac aaattaaact tgactaatca ccggttatta tagttaaaca 16439 taacttaaac cacaacaatt accattcatc aacactatgc actactaact aattaaaaaa 16499 aattacaagc tagcactacg aaattaaaag tggcccggcc gagttgcccc agcacaaaat 16559 agcacgatag atacaggata tacttcctcc gtttctaaat attttacacc gttaactttt 16619 tagcacatgt ttgaccattc atcttattca aaaatttttg tgaaatatat aaaactatat 16679 gtatacataa aagtatattt aacaatgaat caaatgatag gaaaaaaata atacttattt 16739 aaaatttttg aataagacga acggtcaaac atgtttaaaa aagtcaacgg catcgaatat 16799 ttagaaacgg agggagtata tgagaggaat attctcgtga ctagaaccat atgttccaga 16859 aagttgtact ccatccattt taaaatgtaa ggtctatttt gagtggtcac aagtattaag 16919 aatatgaaac ttacagaaag atgagttcaa acgaccacct taattagaaa gagtagtaga 16979 tcgttagtga gacgaatatt atatatatga aagagacaaa aacaattaaa attagtgttt 17039 gcatttgcgt tcatctttac tagctattac tagttactta taagcacatc gtcaaacatg 17099 tacttacgtg ttgcaactta atttctactc cctccaattc agtattggtc gttttggatg 17159 aaaataatat caaagttagc aatccggccg taaccatttt ttcaaacctt gtatgcccaa 17219 tagttacatc gctattcaaa tcaaaggttt caaattttgg attactattg ggtcccaata 17279 gaagcccaaa aagtatttga attttttaac ttaggccccg tttagttccc taaatttttt 17339 ttcaaaaaac atcacatcga atttgtgaac acatgcatga agcattaaat atagataaga 17399 gataatccct catatgccac taaaaattga tctgatccct tatatgccac taaaaattgg 17459 ctcctccctt atatgccatt ggtctaaatt tgcgtaccct ctcatgtcac taccgtcagt 17519 tgaccgtgtg ttgaccgtta actctcaagt aaaaaagaca tattgccctc tctgagttgt 17579 taggcatgcc ctatactcag aagggtaaat acgtcttttt tccttaagaa ttaacggtca 17639 acacatgtca actgacggga gtggcgtgag agagtgtgca aatttggatc aatggcatat 17699 aagggaagaa ctaattgtca atggcatata agggattaga ccaactttcg gtggtatata 17759 agggattctc tctataaata aatgaaaaat ctaattgcac agttagggag gaaatcgcga 17819 gacgaatctt ttgagcctaa ttaatccatg attagccata agtgctacag taacccacat 17879 gtgctaatga cggattaatt aggcttaaaa gattcgtctc gcagtttcca tgcaagttat 17939 gaaattattt ttttcattcg tatctgaaaa acccttccga catccggtca aacatccgat 17999 atgacaccca aaatgtttct tttcgcaaac taaacaggcc cttagcaaaa tggttggtta 18059 tcaactttta aaatatgttg acagtgtctg tgacgacttc atgacggtcc tctttaaagg 18119 tgcttatata gtgatagggt gtgcgtgtat gttcagagcg ttgagtatgc atgtgtatat 18179 atgcatgttt gtgtctgtac tgtgttaaaa aagaaaatcc caagatctag cctaaaattt 18239 tcattaaaaa cattgaaatt ttggccccac gattttttta ttccacaatg taaatttcta 18299 gtcaaattgc tgcgaatgac gcgaaaatta ttttctgacc agtgaactga catgcacaca 18359 ttacactata tttattttat atttattttg aacgtaccta cgactacttc caggggatcg 18419 atcttattct cctcaaatta ataagaacaa gtactctctc catttcaaaa tacaacaacc 18479 taagaatatg gataattttc ttcattgaat cggatggttt cttcggtttt tttgtactac 18539 gatgcgaaca gatggtatat tgaagcctac cggacacgct agcacgtgca tgccgcgtgc 18599 cggcccgtgc atatgagcaa gcctcgcacg ctgacataga cgcagccaag agagaaagca 18659 aacgccaaat caagaagccg agcaatcacg catgccatct caacgcaccg taggtcacta 18719 tctttagcga ggcaagaccg tgacgtcacc gtcaggccat cagcagagga gctgaacctg 18779 gacaaaccgg ggggcccacc ccgcaggcca agttgcggcg acacacacgt ggtccccgcc 18839 ttacattaag gcaagtggcg ccctaattaa tccattgatc aaaaattaat taatccacaa 18899 attaatcaaa tgccctcatc tttttctttt tgccttggct agggttcgag gcactaagat 18959 ccactggtaa tttaattgtg cttgctgtct tgatactaat taattgatca tatatgcgca 19019 agttggtcta tctagagcag aatctagagt gcaactggct gccgcattga aagaaatgct 19079 gctacatggg ctccactgaa agacatttga ctcttttaaa ctttactcga ggctattcct 19139 acctcgatca aagtataatt actaaattta gtactggtgt agtacttata tgtggatttc 19199 gacatttcta ctggtactat ttttatcctt accaattgtt gtatacaggt tgctcggtca 19259 aaaggccatt ttagatgttg gtatatatgt agtgtgaaaa ttaattataa cataactcta 19319 tgttcatatt gatctgcatt tcaaaaagat attgacacac ttattcctaa tttttgaata 19379 aatgatattt tgaagttttc attaaagggt tattatctct gtatgctcta aaacgttgaa 19439 tatttgtgac gcagaattaa tttaatactc atggaataaa taatgatggt gcataatttt 19499 gcaatgattt tcatcaaatg aggtgcatat aggtatcctt tatatgaaat gagaatactt 19559 gccaaaaaca tttttaaaaa gagcttgttt tagctagcta ggttggtgaa tggtgatact 19619 aattaatcaa atgtacatat ttgtgcaaat cctggaagat gaatgcatgg ttttctagtc 19679 ttattatgaa caaattaaat tagaaaaaaa aacatctatc tctttgctct ctccactata 19739 gcttcaaatt gttttttttc cccatgtcta ctattgtagt gaagaatgga ttgtcatgcg 19799 caatgacttt gcaactgaaa ataatggatc aaatgagaga gagggacacc aggtgcaagt 19859 ggcaaaaaaa ctaagccatt tatagcaagt tgcaatagaa aataagacaa tctagagaca 19919 ctcgattata aaaagcgtac gtaaaaagaa taaaagcggt gtattcaaaa ccctagaccc 19979 cacatttcac tatcgatgat accctacttg agaaaacccg cctcctgtgt agcccatagt 20039 tttccatcgt ccttcttaca cgccgagcca aatttgtgca ctcctcgtaa taacatatgc 20099 cttaaaaact tgaactcata ttacattatc acgaaaacaa ttaagccgca taatctcatg 20159 gatataacat ctcatggtgg atccttaatt aacagcttat atatatatat atatatatat 20219 atatatatat atatatatat atatatatat atatatatat tgaccctaac tgtggcaaac 20279 atgcattatt atcacacaaa agttactaac cacatatagg agcctatggc taatggctct 20339 gagtagaaaa atgggcacag aggatctcca tgatactatt tatggcaact cacgtagcaa 20399 aaagccgcag actaacacat ccatggatat ccacaacgca tactgatagt agtctgatat 20459 acacactagc tcctcccatg acggccttag cgaaaaccac tttttaaccc aaaaaaaaaa 20519 ccagttagga ccggtgaaaa gtcgcacgcg atgatcgatt cacgcgcgcg ccgcagaagc 20579 aacttgcaaa agggatcgag cttagctaga tagcgcgagc tcatcagcat ttcgtcgtcg 20639 ccgagcgagc tagtggcttt ggcagttagt agtgatggga gttgcataga agttaagaac 20699 caggtagaca gagatcgatc gattgatcaa acccgtttgg tttcggataa gtatgggaag 20759 aatctgaaac agtgtggagg aaacactgag agagaaagaa caccattaac aataatatcg 20819 atggaattcg ttttttttgg tggttgttgc tagaagccta gaacagcaat tcatgtgatc 20879 gatcgatact tcgatcgtgt gcgtgtgtga cgagaaagag atggggcatg tgaaggcaaa 20939 gacgaggttg acatttgcac agctagccgt tctctcctga cagaattaag ctagaaattg 20999 aagatccgtg actctgagta gtcctaacca attagctata cgcctataca cgatgggcta 21059 gctatgcacg cacgcgacgc caaattgaac acggatgaac aaataaaatc gaacaatggg 21119 ttggctagcg caatcgatcg atcgatctta ccgttgctgg ccatgaggtt ggagaagaat 21179 ccggccggcg agctgctgtg ccgcgccagc aagtccacgc tcccgtcctg cagctgatgc 21239 ccatgccctc cgccccctcc gccgccgccg ccgccgccgc cgtgcgggcc cagcgagatg 21299 tcccccccac cgaaccgcag ccccgccgcc tccgcctccc ttggctgcgg cgtcgtcgac 21359 gacgacgacg gctccacccc tcctcctcct cctcccccca ccggcagaaa cctcctcatc 21419 atacccctcc ttggatcgat cgatcaactc caccccccgc gaccgagacg cggcctctcg 21479 tcgatcgatc tgcagctcgc gcaggcgcag gtaggcaggc ggcgcgtggt gtgggtggaa 21539 atttcggcgt gaaaattaac aaaacgacgg gggcggccta tactatagct agtagaggag 21599 agaagagggg aagggaaggg aagggggagg tgaggtggtg gaggtggtgg ggctaggcgc 21659 aagtgggagg agagggtggt gggattttaa agggaagcga ggccccgtga ttggttctcg 21719 gggcgtgtgg cgccgtgggg accagcggac cggccgggcc cgggcaagtg gatgtctcgc 21779 gcggagtgga gtgggcttct gcactgcgca gcagcagcag tagcaagccg taggtggcgt 21839 cgcgcgcccc gccccggaac cggcaggcat ctctctcggc ttttcgctgc atctttggtg 21899 ctagattttt gtgttggata tatgatgctg atcgaggaaa gggaaggaag aagaaaaaaa 21959 aaaggatttt tttggtgtgg cttagatttt tggatgcttt ctttcctctg ctgcggactg 22019 cggggactag aggatgaact cgataatcaa tggtggtggc ggcaaatgtt tatacttcct 22079 cagtctttta tatttacctt ttgtgatatg gaggaaacaa gctggtttgt ggtgttgtgc 22139 actcgtagga ggggaggtac gtagttaacg gcaaagatcg atcatgcaag ttggttgggt 22199 caatttggtg gtcgagctga cctatgttcg cccatcctct cgatactttt ctcatctaga 22259 ctttttctac gacgctaaca gactgattat cacagtcatt ggatagatcg acatggtcat 22319 ttgaaattgt tcgatataac tggtttaagt tcaaacaaat ccaagctaaa ttttattttg 22379 cggaaaaaat gtttgaattt cacttgtttt caaccgttat tgctgttagc gaccttgccg 22439 ttagggagcg gttttttaac ctcggcacat ccgtaaactc tattgcaggg gagtcatgtg 22499 tatgtctaac agtagtataa ttttatcaca atgatttgtc tctttacgag ttgtattata 22559 aactcacggt gttccccgca aaaaataaaa aataaactca cggtatgtgt aaatggaatt 22619 aggtcaaaat ttaggaatga aatgaataat caattgggtg tgaatgggtc aatgcactaa 22679 accatatgtt ttgctcacta gatatgacaa ggaaaaaccg aaccatcaat aacactggaa 22739 accatgtttt tgtggtgacg cttagttaac tcatacatca attataatct tttctctatc 22799 caattccact ttggtctatt ttgtctattt gaaatcatgt ttcagctatc ttctaagtaa 22859 agcaaacttg aaaacctagt acatctaaac ctagctccac tagtgtggtc caaaagcagt 22919 ggagtaatat cataagagga agacaacaaa aaataggata gagatagtct tagcttgtgc 22979 cgcagtaatt cattcgatag attattagta attcattcga taagatatga taatgatgaa 23039 atgattcggt cgcacggtgg ttgtgaagtc ggagccatga tatgtggcat cgaaagcatt 23099 agtcaaacgg acttcggttt tggtcaggta aagttgtgtt ccttggttct taattcttat 23159 caaaattgga gtcgcctgat catgtgtgcg gtggtgtgat gacgaatgac ggcgagtttt 23219 taccaatgta cagtgaactg cgttttgttt taaggctgtt agttttgttg tcgtggttat 23279 gctttgctag ctagttaggg tgattcctat tttttgtcag gtcttatgaa agttaaaaat 23339 attttttaaa atatgagttt ttttattgtg aataatgagg aacaaatgaa gttttgggag 23399 gatacatggc tagaaaacat ggcttttaaa gataaatatc catctttata ttatatagtt 23459 cgaaggaaaa atttatctat tgctaatgcc atgggatctg ttccgcttaa tgtttctttt 23519 agaagagttt tagttggtca gaatcttgta tattggcatg aattgcgtgc ttctattgta 23579 catattcagt tgaatcaatc tactgactat tttagatgga attatcatca aaatggttta 23639 ttttctgtaa ggtcaatgta tctagcctta agccttaatt aataatggtt acattgagag 23699 aaataagatt atttagaaac ttaagatgcc gcttaaaatt aagattttta tgtggtactt 23759 gcttaaaggg gttatgttaa caaagacaat ttggcaaaac ggaattggaa tggcagctta 23819 agatgttgtt tatgtatgaa aaatgagact attcaatatc tttttataga ttgtcatttt 23879 gcaaaatttg tttggggagc gtttcagtac tcttttggtt tataccttcc tactttcata 23939 cattgtatgt ttgatggttg gctttggggg tgaacaagaa aaggagcaaa ctaattcttg 23999 tagaagcttg tactatatgt taggctctgt gattgagtag gaatgatatg atttttgaca 24059 aatcactatc tatttcattt atgcaggcat tcttcagagc aacatattgg ctccggtttt 24119 gggcacaacc gtaaaagtgt gatgaagatg gagagctttt gaaagttata tgtcgtaagc 24179 ttgagacgac ggttatgcaa ctttttgcca actatggatg gagattcaca aatagactta 24239 aataattgtg tgctccttat attggtctgg tcattttttt tatgtttagg tgtgtgttta 24299 attctatttg aactacactc tttgttaagt gctagattgt aataattggc tgtagctctg 24359 ttgagcaaag gccgagatgt tatctattcc attattaaaa aaaagctagt tagtgtattt 24419 attgttgtat ggcggtttta gcccgattgt tctaaatcaa ctgaatatta atttgctctt 24479 ttttagagaa acacccagag gtcttccggc tgggttagat gaccttggtc cttatccctt 24539 ctaattattt gatattaggt acttcactaa tattcgtatc ttttttaaat taatttgctc 24599 tctttttaaa ctattcatct tttctttaat atagcactaa attaaccgtg atctttcaaa 24659 agaaaaggcg aaaggtgtga atatgcatga aagatcgagt ggacaccccc caaaaaaaaa 24719 aaaccctagt tgttgtcacg tgactctcaa agtccatttg aggacttact aactgtttga 24779 aattaatgga taaggctcca gctaagtagg cgggaaaaga tcaaacgtgt tcagtggatt 24839 tataccaaat gtggtccgtg cgacatgttg gtccataaaa gggcatatga aagtttcctt 24899 tcagctaatt aaagccagtg tcgaatactt atacagtata gttttcgaaa taagttttac 24959 ttctacaatg taatccattc acggatgaaa aagctgtgcg cccaacagct atagctatac 25019 aactatatct atttgttaat taagaggttc atatcttggg cacacaaagg ttctgtttga 25079 atcttctgaa gataaatatg aagatcaaat gttttacgta aaacgaggtg gtaataactt 25139 ttgattaatt agattttaat tattacaaac ttaaaaaaaa gattaatctg atattttata 25199 acaactttca tatagaaaat tttcacacga aacgcaccgt ttaacagttt gaaaagcgtg 25259 ccacgaaaat ctagaactta atctgccctt tgttgggttc tcgaacagga ccaaacttca 25319 tgtccatact ccgtactgta cataccaact atactaaata tcgctaaaac gttttaaaaa 25379 tattatacat atactttcaa tactattata cgtatgcgta aagttttatc ctcaaattca 25439 ttatatttca tacttaaaaa aaattctaat agctttatga atataagtct tagatttttt 25499 tctccatata tatatatgat aaatttaaag atgggacttc acgcgtatat ataaatacta 25559 tttaaagtac atgtacattt ttctaaaaaa ataatatttg ttagtttgta tacattgtgt 25619 gtatacgtga aggctcacgt agacattttg cactctcaat tatttatact agactaataa 25679 ccacctaaat attgttctta gcggttttga cttgagctta cctacgagat gccaacgtgt 25739 cagtccagtc agcaaaaaag ttttaaaaaa actccgtggg cccacttgtc atacttctcc 25799 ctcaatctaa cgcctccccg gtcaccctac tctctcttcc tcgtgcgcac gctcgtgcgg 25859 ccaacggcgg agtggtgcgg tgttggtgtt ggtggtgcgt ccgcgtcggg cacgacggcg 25919 gtgttgccgg agagatggag ctacgcaaga ggcgggcgtc gacaagtatg gaccccagga 25979 ggtggaggct acgcgcgagc gcgtcggccg cgccctttgc cttcaacggc gacgacagca 26039 ttggccgctc gttctcggcc tcgcctctct gaccgcaggt cggcctcagc ctagcagtag 26099 tcggccacgc acgttggcct cattttcgtc accgtgttct tggggctggc atgcaggcgg 26159 gagaaggagg gatagcggca tggactgcgt gtgcgcctgt gcagtgacct gggtggatac 26219 tatgtcaagt tggagctctg caagtcggcg ctctgcgacg gcgacggcaa cagggacaca 26279 tcgtcgttgt caccgtgctg cgcggcacga gaaagatggt ggcactgacg gtgaggcttg 26339 cgatgacaaa tgtgatggtg acaagggccc aacgtcgagg tcgttgccct ggaatagaat 26399 ccgatcggca gctctagcgg tggcaacggc tcagtgctgg cgctagagta cgagcgcggt 26459 cggtggccac agggcggtac cgcggcgcgc gggaggtgct ccaggcggcg cagtgctccc 26519 cccgtgagaa cgagcggttt agcatcatgg ccatcgtcgc gcaccgtcac cgccctgggc 26579 ttctctagcc gaccacgctc cacctcaagc aacccgggga gtagtctttg ctgccaccgc 26639 ggccttctcc tcactgctcg gttcaaggat gagagagaga tcgaggtgga aggaggtaga 26699 agagatgagg tgagaatata tggatcactg acaagtgggc ctgttatttt ttgccgcgtt 26759 agaaatgcca agtcagctaa cctagcctaa aaccgtccaa aatagtgccc cggtattcgt 26819 ctggttttaa gagttttgag gtattgaata caatatatgt tattatagtt tagagggtaa 26879 attgtactac cgtaccataa tagttcgggg gtaaattgta cttcctctgt actcataatg 26939 gaagtcgttt aggacaatat ttaagtcaaa cattgggaat ataaatcatg aataactctc 26999 aagttgttga gtttgaaaat gtaaaaatta tatgaataga tttttcttga aaaatatttt 27059 cataaaagta tacatatatc actttttaat atatattttt atagaaacaa gaagtcaaaa 27119 ttatgttttg gagaccgtgt cgctgtccaa aacgagtacg gagggaatac tttttactcg 27179 tagtttacaa tatcgatctg ttaactgttt ataagagtat ttggatccat gcagtattgt 27239 agtagtagta gcagtacatt tgagaatatt agagtacgaa ttaggtggtg tttggataca 27299 gagacttaac tttagtcttt gtatttagac actaatttag aatattaaat atagactact 27359 tacaaaacta attatataaa tgaaagctaa tttgcgagac aaatttttta agcctaatta 27419 atctataatt agagaatgtt tattgtagca tcatataggc taattatgga ttaattaggc 27479 tcaaaagatt tgtctcgtga attagtctaa gattatgaat gagttttatt aatagtctac 27539 gtttaatatt tataattaat tttcaaacat ctgatgtaat agggacttaa aagactttta 27599 actaccattt aaacagggtc actccaatgg taggtgaaat tcaacagctg ggaaatgcac 27659 tagtgcgttg tgtcagtaaa tttcgtacta gtaccacgag acagctagac agacacgtca 27719 ggtcacgacg cagcactgca gcagggctgt agcctgtacg ggaggcgtag gcgcaacatc 27779 tcgaaaattt tgttccgtag ctaaagcccc cccaaagcca gccgcggttt tcatggattg 27839 cacaggcgtt cctctccgcc ggattccgga aagaaaaaag aaaaaacaag atgtccgttc 27899 cctgggtggt gcatccgttt tctgacaggt gcatgcacct ctcgctcgct accgcggtag 27959 cgcccacacg aaccacgttg gctttcggcc aacttgcccg attctttaat cccctcacga 28019 cgtacgtcgc tgtccaataa aaagttttaa caccaactat agtaaccagc ttaattttaa 28079 taaaaccaaa gaaaattctt aattacttag agcatctcca acagggtcct caaacaaagt 28139 ccctaaataa gttttgagag ttgatgcaaa aaaatatagg tccagcagat tccctactag 28199 agcccccaat ctagggaggc ccctagatca ctcctccaag cccccagtcc ggggggctca 28259 accccacagc ccccatcctc tttttttttg gcgggggaaa tttctgagcg cgcgccatcg 28319 tcatcctccc tcccgcgtca tcgccatcct cccacaacct cccagcgagc aagccgccag 28379 gtcgttttgc tccggtcggc gaccacccga catccctcgg gaagaaactt cggcgagatc 28439 ccatggtgtg ccgcctcaag ccatcacaat catcggtttt actccgtcac tcactgtcgc 28499 cgacctcctt gcatcttcgg ccaacgcacg tcgtcaccga ttgatcatgc ccaggtgagt 28559 ctcgacgtcc tccccgaact agctgcccac ccgctgtcca caaggcctag cgccgaccat 28619 cgacagctat ctatcaccgg ccgcacacat gctgcagtaa aaaaattcag agatgatggt 28679 gattaaaaca aatcacaata gtaaagttca gtttcgtatg tctgagtctc cttgtttgat 28739 tttgatcttt atgggcttac taggcgtcta ggcccatcta aatcattcgc acagcaaaac 28799 gtacattgtc atcattcatg ttttatatgc agtgtcttgt tctatgtcag agagctaatc 28859 ttgcagagca tatataatat ttaagaaata aatttgtgtt tgcactgagt ccttagtact 28919 gcgcaaccaa tatatatgct aaataaatac atattgcaaa cagtataacc tgatgtacat 28979 tgcaatcact tgttgatgtt tctgagatag attggaaagg ttgtcaattt atatatttat 29039 tgcagtgact agattgcaat gacaagtgga ggtgattcct ttgtgcgcat gatgtccgag 29099 gacactgatg tcgaagtgct aatgccaaat gaagaccttc gtacttcaac aaatggtgca 29159 aaaggaagtg ccaaaagatc aagcaactat actcataagg aggacattca attgtgcatt 29219 tcatggcaga gcattagctc agatcctatt attggcaatg agcaaccagg gaaggcatat 29279 tggcagagga tcgcagagca ctaccatgct aaccgtgatt ttgagtctga taggaatgca 29339 aactctcttg agcaccattg gggtaacatt cagaaggaag taagcaagtt tcaaggttgc 29399 tacaatcaaa ttgagcgtcg tcatccaagt ggcataccac atcaagagct tgtaagttaa 29459 attgtttatt tattattatt aataacaatc ttgtatgtat gtgaattaaa acttaaatta 29519 tgttgcaggt tcttgaagct gaggcattat actcgtccac tgcaccaaag aatagggcat 29579 ttcagtttaa tcattgttgg ctcaagttga ggaattctcc aaagtttcaa acactagaat 29639 cccacaagag gccacggtct aggaagtctt cgaccccaat tgagagagct ggtgaagaag 29699 atgaaggaga tgatgctagc aagagtacag ctcctgattt atctcagccg agtgctaaaa 29759 agagaccaat aggtaggaag caagcaaagg aaaagttgaa gaatggagga caagatggac 29819 catacaaaga ggcgatgaaa gatttgcttg acgctaaaga gaaagaagcg aaattgaaag 29879 aagagagatg gaaggaaact aaggagattc aagagcgcaa gctcttattt gctgagcgta 29939 agttagtgtg ggatcaagaa cagaagatta tgttttgtga tgtttccacc ttggaaccgg 29999 atgtgagaac gtatgtgttg gctatgaggg cacagattgc agcttcaaag gtggctgccc 30059 tcaatggtgg atttgatggt agtagtggct ttggaggtga gtttggtggc ggtaatggag 30119 aagtttgagc acttcgatgg aataagttgg attctattgg atgatccatg tgtcctttac 30179 tagtaggata tgccattatc acgattggtc tttggagtcc ttttttgtta attatttcca 30239 caataatttt agtgtcactt gctagtagga catatattac tttcagattt gttatttata 30299 atcgaatcat tcatggttgt aggatgtatt atttttaaat tatataatgc atcattgggt 30359 tcacatagtg tattttttat gagcaatttt cattttcatt ggtgaattac gaatcttggt 30419 tgcatcttgt tgtcgtatat ggcactgtac ccataccata tttacatgtt taaaaatttt 30479 aattttgtat tcgaattgta gtgtttgaaa ttgtgaattt aagtatggtt aaattatgtg 30539 agttagaaat aattgtgttc gaatttttgt ggtgttaaac atactgtata tggattgtat 30599 tttaaaatac aagataaaca tgagtaggga ctaagaaata ggggctactg ctggagttgg 30659 aggcattttt tagtccttga gaaatggggg cagccctcat ttaactttta gacgcttcaa 30719 aataaggtct attgctggag atgctcttag gtccccatcg tttccttcaa tcagcattag 30779 ccgctaccaa aatttgaaat tttaaagttt ttcatcgaag tttattttcc agcattggta 30839 tttaagtcgc taaaaacaca tatatgaaag tcttatctgt aaattattat tattttgcta 30899 atacgccgaa tggcgtatta tatgtatttg gccaaaggat gggggcctta aaccttagcc 30959 ttagtcgtgc cctacaaaag acacacgcct cgtcagggca agggtactcg agcgtggagg 31019 catggttcgc aagccatggt cggcgaggcc atgctctagc aatgcggtgc agtccacctc 31079 ctctccgagc gcggagctcc aacgggtgat ggccaatgaa agaaggagac cgacttgccg 31139 ttggttgtag catgtaaatt tcttgcactt tcttaataaa tttcggctag tgttcgctag 31199 ctcgaccaaa aaaaagagag gctaatgatg gggttaggaa gtgaaaacaa gcgcagtgga 31259 ggagaagaag atcgagaggc ctatttgtat gatgctttgt cgatgtagat ttagtcccat 31319 gctcatctca tccctcagcc acaacaatcc catcattgta gagctcatca gcttgctcta 31379 ccatctctcc ttgtttatgg gccactccca acctgctacc catcgcctga tctatgaatc 31439 tagctgtcaa tgacctcatt ggccctagtc ttgagatcac ccagtggatc ctttggcaaa 31499 gtggatccgc ctttgttttg ctttggagaa agaaaacgat gacttagcta aagatctcgt 31559 cggtcaaaaa gagagatgcc tttgatatat gctgaaaaat agaggagagg cagtgtcagc 31619 tggagagctc tttatccaca cccgtgggga tcgagcttat tggcgtagag ggagagacat 31679 tgagggagag agagtgcaag gggatttttt tgtaatttct agatttggtg gtgtttagtg 31739 caatactttg aacttatttg taaattaagt aaaacatgat tgtaatagaa aatatcataa 31799 actgacatag aaaaacaaag ataacaattg aagccactag cgctatggag aaaatgtgtg 31859 acctcggtct acatataacg gctatgtgtt attaccatgt cacttctaaa actaccatat 31919 aaccatatac gtttttctcc tacttatcaa aaatataatt aacaaatttt tttaccggtt 31979 tagtttacaa gaaaaaagtt tgactgcatt gttgataccc taccatcctt gtacgaaggc 32039 aggcgctaca caacaccgct gccgctgccg tcgccgccgt aagctaaggc tgtcacgccg 32099 gcgaccggcc acggccgacg tggaaagcga cctaatctgt aaagtgtaaa cccaccctat 32159 agaaaaaccc ggttggtggg acgagaatca ccgaatcagc gtcgacgacg acggccgacg 32219 actccagcag cgggggtcac gagactcgga gccgagagag agaaagagga ccacgcgcgc 32279 attcactcaa ctgcataaaa aaacccccgc gcggcggctg cgcagtcacg tctacgctcg 32339 cgggatcgct cgatgaaatc aaccaaaatc ttaaacaaac cgaaccaacc aaccaaccgt 32399 cgcgcgtgtg cgcgcgaggc gctcgattag cggagacgca aacccatgta acaccgtgcg 32459 gaaaaactta aagaaatccg cgtcgctcgc gccgtcgcgc gcgcgggggg cgcgtagtac 32519 ctccacacac gattctgcac ttgtactacc acgcgaacct gatgcggttt accggtcatc 32579 gattggctgc gaggcttgct gttactggtg gtggtagact ggtagtacgt tgcttgtact 32639 acctcactca tgtctggaga ttactacact tcgatctttt cctctgtttt gttaattgag 32699 atttggaggt gttactgttc gctgtgtggt taagtatatt ggtgtataac tacaagttgg 32759 tactctcaaa gggaaaaaaa ggtactgcaa attggctaat ctatgattct attctgcaca 32819 tgcatataga taagcactat aataaggaac tgaggatcgt gaaaagtggc attaattata 32879 acaggaccat gtacgactat accactggca gggatttcac ggaatcaact ataggagtag 32939 gttagttggc acttggcaag gttgattgat tcactaacgt ggggaaaaga acacacgaga 32999 tcaaaggctg tcgtgggctt aaaataaaag ggcccatctg ggatcagctc ttttaagccc 33059 acatcactag ccaggaggct aggagtccag tattgcctcg tactgggccg tcctctgaaa 33119 tttggaggcc ctgtctaaaa ttctaatcaa gccttaaact taagtgacaa aataaaaaga 33179 ggtagactat ataacagcat accattacaa cggaatagct gtcgttagca cgatactcta 33239 tatgcatcag atatggtacc aggtactata ccgacgttag catgatccga taggtatagg 33299 atctggtgta cctagatatt atgctaacat aatcatgaca tcagctattc cattggaatg 33359 atataccggt ggtatcttcg gtaaattgtg agcatgctag gaatttaagt aaagggcctt 33419 agggttaaaa tcacacgttc ttagtcactg cactatcaag tgcatttcaa ccctaatgcc 33479 cttttatgat ctatatctgc cctcctagcc tattttggac gaggctccct cgtcctagaa 33539 gtaaatcatc gtatccataa tccaaccgat tagtagagaa aaaacatact tttcgaacgc 33599 aacagttctt gtcatcttgt gctctcaaat gttcattttc cccttactta aaggacatgg 33659 aaaacagaac agaccc 33675 <210> SEQ ID NO 3 <211> LENGTH: 1119 <212> TYPE: DNA <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)..(1119) <400> SEQUENCE: 3 atg cat aac cag gct cca att caa cgt aga aaa tca aca cgt att tac 48 Met His Asn Gln Ala Pro Ile Gln Arg Arg Lys Ser Thr Arg Ile Tyr 1 5 10 15 gtt ggg aat gtg ccg att ggc gat ggt gct ccc atc gcc gta cag tcc 96 Val Gly Asn Val Pro Ile Gly Asp Gly Ala Pro Ile Ala Val Gln Ser 20 25 30 atg acc aat acg cgt acg aca gac gtc gaa gca acg gtc aat caa atc 144 Met Thr Asn Thr Arg Thr Thr Asp Val Glu Ala Thr Val Asn Gln Ile 35 40 45 aag gcg ctg gaa cgc gtt ggc gct gat atc gtc cgt gta tcc gta ccg 192 Lys Ala Leu Glu Arg Val Gly Ala Asp Ile Val Arg Val Ser Val Pro 50 55 60 acg atg gac gcg gca gaa gcg ttc aaa ctc atc aaa cag cag gtt aac 240 Thr Met Asp Ala Ala Glu Ala Phe Lys Leu Ile Lys Gln Gln Val Asn 65 70 75 80 gtg ccg ctg gtg gct gac atc cac ttc gac tat cgc att gcg ctg aaa 288 Val Pro Leu Val Ala Asp Ile His Phe Asp Tyr Arg Ile Ala Leu Lys 85 90 95 gta gcg gaa tac ggc gtc gat tgt ctg cgt att aac cct ggc aat atc 336 Val Ala Glu Tyr Gly Val Asp Cys Leu Arg Ile Asn Pro Gly Asn Ile 100 105 110 ggt aat gaa gag cgt att cgc atg gtg gtt gac tgt gcg cgc gat aaa 384 Gly Asn Glu Glu Arg Ile Arg Met Val Val Asp Cys Ala Arg Asp Lys 115 120 125 aac att ccg atc cgt att ggc gtt aac gcc gga tcg ctg gaa aaa gat 432 Asn Ile Pro Ile Arg Ile Gly Val Asn Ala Gly Ser Leu Glu Lys Asp 130 135 140 ctg caa gaa aag tat ggc gaa ccg acg ccg cag gcg ttg ctg gaa tct 480 Leu Gln Glu Lys Tyr Gly Glu Pro Thr Pro Gln Ala Leu Leu Glu Ser 145 150 155 160 gcc atg cgt cat gtt gat cat ctc gat cgc ctg aac ttc gat cag ttc 528 Ala Met Arg His Val Asp His Leu Asp Arg Leu Asn Phe Asp Gln Phe 165 170 175 aaa gtc agc gtg aaa gcg tct gac gtc ttc ctc gct gtt gag tct tat 576 Lys Val Ser Val Lys Ala Ser Asp Val Phe Leu Ala Val Glu Ser Tyr 180 185 190 cgt ttg ctg gca aaa cag atc gat cag ccg ttg cat ctg ggg atc acc 624 Arg Leu Leu Ala Lys Gln Ile Asp Gln Pro Leu His Leu Gly Ile Thr 195 200 205 gaa gcc ggt ggt gcg cgc agc ggg gca gta aaa tcc gcc att ggt tta 672 Glu Ala Gly Gly Ala Arg Ser Gly Ala Val Lys Ser Ala Ile Gly Leu 210 215 220 ggt ctg ctg ctg tct gaa ggc atc ggc gac acg ctg cgc gta tcg ctg 720 Gly Leu Leu Leu Ser Glu Gly Ile Gly Asp Thr Leu Arg Val Ser Leu 225 230 235 240 gcg gcc gat ccg gtc gaa gag atc aaa gtc ggt ttc gat att ttg aaa 768 Ala Ala Asp Pro Val Glu Glu Ile Lys Val Gly Phe Asp Ile Leu Lys 245 250 255 tcg ctg cgt atc cgt tcg cga ggg atc aac ttc atc gcc tgc ccg acc 816 Ser Leu Arg Ile Arg Ser Arg Gly Ile Asn Phe Ile Ala Cys Pro Thr 260 265 270 tgt tcg cgt cag gaa ttt gat gtt atc ggt acg gtt aac gcg ctg gag 864 Cys Ser Arg Gln Glu Phe Asp Val Ile Gly Thr Val Asn Ala Leu Glu 275 280 285 caa cgc ctg gaa gat atc atc act ccg atg gac gtt tcg att atc ggc 912 Gln Arg Leu Glu Asp Ile Ile Thr Pro Met Asp Val Ser Ile Ile Gly 290 295 300 tgc gtg gtg aat ggc cca ggt gag gcg ctg gtt tct aca ctc ggc gtc 960 Cys Val Val Asn Gly Pro Gly Glu Ala Leu Val Ser Thr Leu Gly Val 305 310 315 320 acc ggc ggc aac aag aaa agc ggc ctc tat gaa gat ggc gtg cgc aaa 1008 Thr Gly Gly Asn Lys Lys Ser Gly Leu Tyr Glu Asp Gly Val Arg Lys 325 330 335 gac cgt ctg gac aac aac gat atg atc gac cag ctg gaa gca cgc att 1056 Asp Arg Leu Asp Asn Asn Asp Met Ile Asp Gln Leu Glu Ala Arg Ile 340 345 350 cgt gcg aaa gcc agt cag ctg gac gaa gcg cgt cga att gac gtt cag 1104 Arg Ala Lys Ala Ser Gln Leu Asp Glu Ala Arg Arg Ile Asp Val Gln 355 360 365 cag gtt gaa aaa taa 1119 Gln Val Glu Lys 370 <210> SEQ ID NO 4 <211> LENGTH: 686 <212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE: 4 Met Ala Thr Gly Val Ala Pro Ala Pro Leu Pro His Val Arg Val Arg 1 5 10 15 Asp Gly Gly Ile Gly Phe Thr Arg Ser Val Asp Phe Ala Lys Ile Leu 20 25 30 Ser Val Pro Ala Thr Leu Arg Val Gly Ser Ser Arg Gly Arg Val Leu 35 40 45 Val Ala Lys Ser Ser Ser Thr Gly Ser Asp Thr Met Glu Leu Glu Pro 50 55 60 Ser Ser Glu Gly Ser Pro Leu Leu Gly Ile Thr Arg Arg Leu Leu Phe 65 70 75 80 Thr Leu His Met Val Gly Asn Val Pro Leu Gly Ser Asp His Pro Ile 85 90 95 Arg Ile Gln Thr Met Thr Thr Ser Asp Thr Lys Asp Val Ala Lys Thr 100 105 110 Val Glu Glu Val Met Arg Ile Ala Asp Lys Gly Ala Asp Phe Val Arg 115 120 125 Ile Thr Val Gln Gly Arg Lys Glu Ala Asp Ala Cys Phe Glu Ile Lys 130 135 140 Asn Thr Leu Val Gln Lys Asn Tyr Asn Ile Pro Leu Val Ala Asp Ile 145 150 155 160 His Phe Ala Pro Thr Val Ala Leu Arg Val Ala Glu Cys Phe Asp Lys 165 170 175 Ile Arg Val Asn Pro Gly Asn Phe Ala Asp Arg Arg Ala Gln Phe Glu 180 185 190 Gln Leu Glu Tyr Thr Glu Asp Asp Tyr Gln Lys Glu Leu Glu His Ile 195 200 205 Glu Lys Val Pro Asn Ile Ser Leu Phe Ser Val Asn Leu Val Phe Ser 210 215 220 Pro Leu Val Glu Lys Cys Lys Gln Tyr Gly Arg Ala Met Arg Ile Gly 225 230 235 240 Thr Asn His Gly Ser Leu Ser Asp Arg Ile Met Ser Tyr Tyr Gly Asp 245 250 255 Ser Pro Arg Gly Met Val Glu Ser Ala Leu Glu Phe Ala Arg Ile Cys 260 265 270 Arg Lys Leu Asp Phe His Asn Phe Val Phe Ser Met Lys Ala Ser Asn 275 280 285 Pro Val Ile Met Val Gln Ala Tyr Arg Leu Leu Val Ala Glu Met Tyr 290 295 300 Asn Leu Gly Trp Asp Tyr Pro Leu His Leu Gly Val Thr Glu Ala Gly 305 310 315 320 Glu Gly Glu Asp Gly Arg Met Lys Ser Ala Ile Gly Ile Gly Thr Leu 325 330 335 Leu Met Asp Gly Leu Gly Asp Thr Ile Arg Val Ser Leu Thr Glu Pro 340 345 350 Pro Glu Glu Glu Ile Asp Pro Cys Arg Arg Leu Ala Asn Leu Gly Thr 355 360 365 His Ala Ala Asp Leu Gln Ile Gly Val Ala Pro Phe Glu Glu Lys His 370 375 380 Arg Arg Tyr Phe Asp Phe Gln Arg Arg Ser Gly Gln Leu Pro Leu Gln 385 390 395 400 Lys Glu Ala Pro Glu Leu Leu Tyr Arg Ser Leu Ala Ala Lys Leu Val 405 410 415 Val Gly Met Pro Phe Lys Asp Leu Ala Thr Val Asp Ser Ile Leu Leu 420 425 430 Lys Glu Leu Pro Pro Val Glu Asp Ala Gln Ala Arg Leu Ala Leu Lys 435 440 445 Arg Leu Val Asp Ile Ser Met Gly Val Leu Thr Pro Leu Ser Glu Gln 450 455 460 Leu Thr Lys Pro Leu Pro His Ala Ile Ala Leu Val Asn Val Asp Glu 465 470 475 480 Leu Ser Ser Gly Ala His Lys Leu Leu Pro Glu Gly Thr Arg Leu Ala 485 490 495 Val Thr Leu Arg Gly Asp Glu Ser Tyr Glu Gln Leu Asp Leu Leu Lys 500 505 510 Gly Val Asp Asp Ile Thr Met Leu Leu His Ser Val Pro Tyr Gly Glu 515 520 525 Glu Lys Thr Gly Arg Val His Ala Ala Arg Arg Leu Phe Glu Tyr Leu 530 535 540 Glu Thr Asn Gly Leu Asn Phe Pro Val Ile His His Ile Glu Phe Pro 545 550 555 560 Lys Ser Val Asn Arg Asp Asp Leu Val Ile Gly Ala Gly Ala Asn Val 565 570 575 Gly Ala Leu Leu Val Asp Gly Leu Gly Asp Gly Val Leu Leu Glu Ala 580 585 590 Ala Asp Gln Glu Phe Glu Phe Leu Arg Asp Thr Ser Phe Asn Leu Leu 595 600 605 Gln Gly Cys Arg Met Arg Asn Thr Lys Thr Ile Ala Ile Met Gly Cys 610 615 620 Ile Val Asn Gly Pro Gly Glu Met Ala Asp Ala Asp Phe Gly Tyr Val 625 630 635 640 Gly Gly Ala Pro Gly Lys Ile Asp Leu Tyr Val Gly Lys Thr Val Val 645 650 655 Gln Arg Gly Ile Ala Met Glu Gly Ala Thr Asp Ala Leu Ile Gln Leu 660 665 670 Ile Lys Asp His Gly Arg Trp Val Asp Pro Pro Val Glu Glu 675 680 685 <210> SEQ ID NO 5 <211> LENGTH: 594 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..594) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 5 aaaatcgtca atccctctca aactcttctc accactaatt tcttcctctg gaacattctc 60 ttctctatta ttttgattcc cttggcctca acactggttt ctcaattgca tgatcttggc 120 tcgtcttcag ttactttgat tcactgagaa aaatggcgac tggagtattg ccagctccgg 180 tttctgggat caagataccg gattcgaaag tcgggtttgg taaaagcatg aatcttgtga 240 gaatttgtna tgttaggagt ctaagatctg ctaggagaag agtttcggtt atccggaatt 300 caaaccaagg ctctgattta gctgagcttc aaccctgcat ccgaaggaaa gcccctcttc 360 ttagtgccaa ggcaggaaat attgtgaatc attgcataan gcggttagga ggaagnctcg 420 gacctgtaat ggttgaaatg tcgncccttn gaagngnaca ccggtanggg tcaaacggtg 480 ccttcttngg gtacaaaang tnttccttgg ancctnttng tgggggtttt gggattgcgg 540 aaaaaggggc tgnttttnaa gggnacctnn caaggnagna agggngggtc tttt 594 <210> SEQ ID NO 6 <211> LENGTH: 615 <212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..615) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 6 accagaagtg atgagcctta tgaagaactg gacattctta agggtgttga tgctactatg 60 cttttccatg accttcctta tacagaagac agaattagca gagtgcatgc aaccagacgg 120 ttatttgagt acctatctga caattctcta aacttccctg ttattcacca tattcagttc 180 ccaaatggga ttcacaggga tgacttggta attggtgctg gttctgatgc tggagccctt 240 ctggttgatg ggcttggaga tggactactt ttggaagccc cggacaagga ttttgaattt 300 attagaaaca cttctttcaa tttgttgcaa ggctgcagaa tgagaaatac aaagacagag 360 tatgtctcat gtccatcctg tggcagaaca ttgtttgatc ttcaagaagt aagtgcacaa 420 attcgggaga agacatcaca cctncctggt gtttcgattg caatcatggg atgcattgtt 480 aatggaccag gggagatggc tgatgcagac tttgggtatg tgggaagcac tccccggaag 540 attgacctct atgttgggaa gactggtgtg aagcgtggga attcaatgga gcatgccaac 600 catggcttga tccga 615 <210> SEQ ID NO 7 <211> LENGTH: 589 <212> TYPE: DNA <213> ORGANISM: Lycopersicon esculentum <400> SEQUENCE: 7 tggcgatgaa tcacatgatg agttggaaat cctgaagagc tctgatgtta caatgattct 60 tcataatctg ccatatacag aggaaaaaat tggcagggtt caagcagcca ggaggctttt 120 tgagtatctt tccgagaatt ccttgaactt tccagtgatt catcacatac aatttcccag 180 caacacccac agagatgact tagtgattgg tgccgggaca aatgcgggag ccctcttggt 240 agatgggctt ggtgatggac ttctcttgga agctccagac aaggattttg attttctcag 300 aaatacatct ttcaatttgc ttcaaggttg cagaatgcgg aacacaaaaa cggaatatgt 360 atcatgccca tcctgtggca gaactttatt cgatcttcaa gagataagcg ctcaaattag 420 agagaagacg tcacacttgc ctggtgtttc aattgccatc atgggttgca ttgtgaatgg 480 acctggggag atggctgatg ctgactttgg atatgttggt ggtgctcctg gaaagattga 540 cctttacgtc ggcaagacag tggtgaaacg ccctattgaa atggagcat 589 <210> SEQ ID NO 8 <211> LENGTH: 617 <212> TYPE: DNA <213> ORGANISM: Mesembryanthemum crystallinum <400> SEQUENCE: 8 gaaaagcata gacattattt tgactttcaa cgtagaactg gtcaattacc gattcagaaa 60 gagggtgaag atgtggacta tagaggtgtc ctacaccgtg atggttctgt cctcatgact 120 gtttccttgg acatgttgaa gacacctgaa ctcctttaca agtcattagc agcaaagctt 180 gttgttggca tgccatttaa ggatctggct actgtagact ctatttttct gagagagctt 240 tcaccagtag atgactctga tgctcggcta gctctgaaga ggttaataga tataagtatg 300 ggtgtcatag ctcctttttc tgagcaactg acaaagccct tgccaaatgc aattgtattg 360 gtgaacctta aagagttgtc aaccggtgca tacaagcttt taccagtagg aacccgcttg 420 gcagtatctg tgcgaggtga tgaaccatat tgagacattg gagatcctta aagatattga 480 tgcttcaatg gctttttatg aactgtcttt taccgagagg atattcacac agtgcatgct 540 ggaccaaagc ttttgaggtc ctatcagata agcttggacc tcccgtaatt aacatatcct 600 atcccttcgg attaagg 617 <210> SEQ ID NO 9 <211> LENGTH: 416 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..416) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 9 ggattcggca cgagtctaat tgatggtctt ggtgatggtg tacttcttga aagctgctga 60 ccaagaaatt tgagtttttg agggacacat cctccaactt gttacagggc tgcaggatgc 120 gcaacacaaa aacggaatat ttccctggtc ctcctggtgg gcggacacnc tttnaccncc 180 aaaaattcan tgctcaaatt aaanaaaaaa ccnctcatct gccaggcntc tctattgcta 240 tcatgggtng cattgtcaat gggccagggg aaatggccaa tcctaattnc ggatacttng 300 gaggtgccct ggagaaaatc nacctntatn ttggttnttt tttttnnaac ggggcatngc 360 aanagaaggg ggcccnnacc ccnanatncn ttcnccgggn ccngggccgn ggggtt 416 <210> SEQ ID NO 10 <211> LENGTH: 621 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 10 gaattcggca ccagaagcca ctcccacatg caattgtact tgtcaacctc gacgaattgt 60 caagtggtgc acacaaactt ttgccagaag gcactagact agctgtcact cttcgtggtg 120 atgaatcata cgagcagcta gatattctta aggatgttga tgatataaca atgttgttac 180 ataatgttcc atatggtgag gagaagacag gcagggtgca tgctgctagg aggttatttg 240 agtacttaca ggccaatggc ttgaacttcc ctgtaattca tcacataaat ttccctgaaa 300 ccattgacag agatggtctt gtcattggtg ctggggccaa cgttggtgct ctcttagtcg 360 atggtcttgg tgatggtgta ttccttgaag ctgctgacca ggaatttgag tttctgaggg 420 acacatcttt caacttgctc caaggttgca ggatgcgcaa cacaaaaact gaatatgtgt 480 cttgtccttc ctgcggccga acactctttg accttcagga aatcagcgct gagattagag 540 aaaagacctc tcatctgcca ggtgtctcga tcgctatcat gggctgtatt gcaatggacc 600 aggagagatg gctgatgccg a 621 <210> SEQ ID NO 11 <211> LENGTH: 601 <212> TYPE: DNA <213> ORGANISM: Pinus taeda <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..601) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 11 aatgcaagaa gtacggaagg gcaatgcgaa ttggcacaaa ccatggaagt ctttccgatc 60 gtactatgag ttattatggt gattctccca ggggtatggt ggaatcagca tttgaatttg 120 cacgcatttg ccggaagttg ggttttcata attttgtgtt ttcaatgaaa gcgagcgatc 180 ctgtagtcat ggttcaggca taccgtttac ttgttgcgga gatgtatgtg caaggatggg 240 attatccatt gcatttagga gttactgaag ctggtgaagg tgaagatgga cgcatgaagt 300 ctgcaattgg cattggaaca cttttgcagg atggtttggg tgatactatt cgagtttccc 360 ttacagaacc tccagaagag gagatcaatc cctgtagaag acttgcaaat cttgggatgc 420 aagctgcaaa gctanggaaa ggagtggctc cttttgagga gaacatcgtc attactttac 480 tttccaacgc angactggcn agctccagta cagaaggagg gtgatgaggt ggatacagag 540 gagtccgcat cgtgatggtc tgttctaatg tcagtgtcct tgacagntga agacacanaa 600 a 601 <210> SEQ ID NO 12 <211> LENGTH: 443 <212> TYPE: DNA <213> ORGANISM: Physcomitrella patens <400> SEQUENCE: 12 gcacgtatct gccgcaaaca tgactatatt aatttcttgt tttctatgaa agcaagcaat 60 ccggtcgtaa tggttcaagc atatcggctt ttagtatctg agatgtatgt gaacaactgg 120 gactacccat tacatcttgg tgttactgag gctggagagg gagaggatgg tcgcatgaag 180 tcagctatcg gcattggtgc tttacttcag gatggtctcg gtgacaccat acgtgtttca 240 ttgacggaag ctcctgaaga agaaattgat ccttgcacaa agcttgcaaa ccttggcatg 300 aagatttctg cagaacagaa gggggtggct gaattcgaag agaagcaccg gcgatacttt 360 gacttccaac gaaggaccgg ccaacttcca ctgcagaggg agggagagtt ggtggactac 420 agaaacgttc tgcaccgtga tgg 443 <210> SEQ ID NO 13 <211> LENGTH: 938 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..938) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 13 atgatactgc cagctannnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 60 nnnnnnnnnn nnnnnnnnnn nnnccacgcg tccgaaaacg ttttatcctg agtttctttc 120 accatccagc ttcatttgtg aaaaatcgtc aatccctctc aaactcttct caccactaat 180 ttcttcctct ggaacattct cttctctatt attttgattc ccttggcctc aacactggtt 240 tctcaattgc atgatcttgg ctcgtcttca gttactttga ttcactgaga aaaatggcga 300 ctggagtatt gccagctccg gtttctggga tcaagatacc ggattcgaaa gtcgggtttg 360 gtaaaagcat gaatcttgtg agaatttgtg atgttaggag tctaagatct gctaggagaa 420 gagtttcggt tatccggaat tcaaaccaag gctctgattt agctgagctt caacctgcat 480 ccgaaggaag ccctctctta gtgccaagac agaaatattg tgaatcattg cataagacgg 540 tgagaaggaa gactcgtact gttatggttg gaaatgtcgc ccttggaagc gaacatccga 600 taaggattca aacgatgact acttcggata caaaagatat tactggaact gttgatgagg 660 ttatgagaat agcggataaa ggagctgata ttgtaaggat aactgtccaa gggaagaaag 720 aggcggatgc gtgctttgaa ataaaagata aactcgttca gcttaattac aatataccgc 780 tggttgcaga tattcattgt gcccctactg tagccttacg agtcgctgaa tgctttgaca 840 agatccgtgt caacccagga aattttgcgg acaggcgggc ccagtttgag acgattgatt 900 atacagaaga tgaatatcag aaagaactcc agcatatc 938 <210> SEQ ID NO 14 <211> LENGTH: 432 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 14 agcataacaa ggctctgatt tagctgagct tcaacctgca tccgaaggaa gccctctctt 60 agtgccaaga cagaaatatt gtgaatcatt gcataagacg gtgagaagga agactcgtac 120 tgttatggtt ggaaatgtcg cccttggaag cgaacatccg ataaggattc aaacgatgac 180 tacttcggat acaaaagata ttactggaac tgttgatgag gttatgagaa tagcggataa 240 aggagctgat attgtaagga taactgttca agggaagaaa gaggcggatg cgtgctttga 300 aataaaagat aaactcgttc agcttaatta caatataccg ctggttgcag atattcattt 360 tgcccctact gtagccttac gagtcgctga atgctttgac aagatccgtg tcaacccaag 420 aaattttgcg ga 432 <210> SEQ ID NO 15 <211> LENGTH: 528 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..528) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 15 tgatacgcca gctctatacg actcactatt agggaagctg gtacgcctgc aggtacccgg 60 tccgggaatt cccngggtcg acccacgcgt ccgaaagaac tccagcatat cgagcaggtc 120 ttcactcctt tggttgagaa atgcaaaaag tacgggagag caatgcgtat tgggacaaat 180 catggaagtc tttctgaccg tatcatgagc tattacgggg attctccccg aggaatggtt 240 gaatctgcgt ttgagtttgc aagaatatgt cggaaattag actatcacaa ctttgttttc 300 tcaatgaaag cgagcaaccc agtgatcatg gtccaggcgt accgtttact tgtggctgag 360 atgtatgttc atggatggga ttatcctttg catttgggag ttactgaggc aggagaaggc 420 gaagatggac ggatgaaatc tgcgattgga attgggacgc ttcttcagga cgggctcggt 480 gacacaataa gagtttcact gacggagcca ccagaagagg agatagat 528 <210> SEQ ID NO 16 <211> LENGTH: 379 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 16 gcgtattggg acaaatcatg gaagtctttc tgaccgtatc atgagctatt acggggattc 60 tccccgagga atggttgaat ctgcgtttga gtttgcaaga atatgtcgga aattagacta 120 tcacaacttt gttttctcaa tgaaagcgag caacccagtg atcatggtcc aggcgtaccg 180 tttacttgtg gctgagatgt atgttcatgg atgggattat cctttgcatt tgggagttac 240 tgaggcagga gaaggcgaag atggacggat gaaatctgcg attggaattg ggacgcttct 300 tcaggacggg ctcggtgaca caataagagt ttcactgacg gagccaccag aagaggagat 360 agatccctgc aagcgattg 379 <210> SEQ ID NO 17 <211> LENGTH: 395 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 17 aaagaactcc agcatatcga gcaggtcttc actcctttgg ttgagaaatg caaaaagtac 60 gggagagcaa tgcgtattgg gacaaatcat ggaagtcttt ctgaccgtat catgagctat 120 tacggggatt ctccccgagg aatggttgaa tctgcgtttg agtttgcaag aatatgtcgg 180 aaattagact atcacaactt tgttttctca atgaaagcga gcaacccagt gatcatggtc 240 caggcgtacc gtttacttgt ggctgagatg tatgttcatg gatgggatta tcctttgcat 300 ttgggagtta ctgaggcagg agaaggcgaa gatggacgga tgaaatctgc gattggaatt 360 ggggacactt cttcaggacg ggctcggtga cacaa 395 <210> SEQ ID NO 18 <211> LENGTH: 395 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 18 aaagaactcc agcatatcga gcaggtcttc actcctttgg ttgagaaatg caaaaagtac 60 gggagagcaa tgcgtattgg gacaaatcat ggaagtcttt ctgaccgtat catgagctat 120 tacggggatt ctccccgagg aatggttgaa tctgcgtttg agtttgcaag aatatgtcgg 180 gaattagact atcacaactt tgttttctca atgaaagcga gcaacccagt gatcatggtc 240 caggcgtacc gtttacttgt ggctgagatg tatgttcatg gatgggatta tcctttgcat 300 ttgggagtta ctgatgcagg agaaggcgaa gatggacgga tgaaatctgc gattggaatt 360 gggacgcttc ttcaggacgg gctcggtgac acaat 395 <210> SEQ ID NO 19 <211> LENGTH: 412 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 19 atgctggagg ccttcttgtg gatggactag gtgatggcgt aatgctcgaa gcacctgacc 60 aagattttga ttttcttagg aatacttcct tcaacttatt acaaggatgc agaatgcgta 120 acactaagac ggaatatgta tcgtgcccgt cttgtggaag aacgcttttc gacttgcaag 180 aaatcagcgc cgagatccga gaaaagactt cccatttacc tggcgtttcg atcgcaatca 240 tgggatgcat tgtgaatgga ccaggagaaa tggcagatgc tgatttcgga tatgtaggtg 300 gttctcccgg aaaaatcgac ctttatgtcg gaaagacggt ggtgaagcgt gggatagcta 360 tgacggaggc aacagatgct ctgatcggtc tgatcaaaga acatggtcgt tg 412 <210> SEQ ID NO 20 <211> LENGTH: 1172 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..1172) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 20 gggtatgcca ttcaaggatc tggcaactgt tgattcaatc ttattaaaga gagctaccgc 60 ctgtagatga tcaagtggct cgtttggctc taaaacggtt gattgatgtc agtatgggag 120 ttatagcacc tttatcagag caactaacaa agccattgcc caatgccatg gttcttgtca 180 acctcaagga actatctggt ggcgcttaca agcttctccc tgaaggtaca cgcttggttg 240 tctctctacg aggcgatgag ccttacgagg agcttgaaat actcaacaac attgatgcta 300 cgatgattct ccatgatgta cctttcactg aagacaaagt tagcagagta catgcagctc 360 ggaggctatt cgagttctta tccgagaatt cagttaactt tcctgttatt catcacataa 420 acttcccaac cggaatccac agagacgaat tggtgattca tgcagggaca tatgctggag 480 gccttcttgt ggatggacta cgtgatggcg taatgctcga agcacctgac caagattttg 540 attttcttag gaatacttcc ttcaacttat tacaaggatg cagaatgcgt aacactaaga 600 cggaatatgt atcgtgcccg tcttgtggaa gaacgctttt cgacttgcaa gaaatcagcg 660 ccgagatccg agaaaagact tcccatttac ctggcgtttc gatcgcaatc atgggatgca 720 ttgtgaatgg accaggagaa atggcagatg ctgatttcgg atatgtaggt ggttctcccg 780 gaaaaatcga cctttatgtc ggaaagacgg tggtgaagcg tgggatagct atgacggagg 840 caacagatgc tctgatcggt ctgatcaaag aacatggtcg ttgggtcgac ccgcccgtgg 900 ccgatgagta gatttcaaaa cggagaaaga tgggtgggcc attctttgaa aactgtgaga 960 ggagatatat atatttgtgt gtgtatatca tctgtttgtt gtgtattgca tcattcattt 1020 tggacaaatg tccaaattct cttaagttga taaaagttct taggccaaat taaatttaat 1080 ataaaaaaaa aaaaaaaaag gcnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn 1172 <210> SEQ ID NO 21 <211> LENGTH: 584 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 21 caggttaatt aattcctgta cgccgtcggt ttcgggtact cgtttaattt cttcccgacc 60 acggttgatg gcaatgtaac cggcttgttt acccacatag ccatagtcgg catcggccat 120 ttccccgggg ccattgacaa tacagcccat gacggcgatg tctaaacccg ttagatgttt 180 agtggcttct cggacttcat gtaacacgtc ttccaagttg aacaacgtgc ggccacagga 240 aggacaggcc acatattcca ccatggtttt ccgcaaaccc agcgcctgga gaatgctgta 300 gcaaacggga atttcttttt cgggggcttc ggtgagggat acccggatag tatcgccaat 360 gccatcagct aaaagggtgg caatgccagc ggtggattta atgcggccat attccccatc 420 cccggcttcg gtaaccccta gatggagggg ataatccatg cccaactcgt tcatacgttt 480 caccatgagg cgataggcgg ccaacattac cggtacccgg gacgctttca tggaaacgac 540 taggttgcgg aaatctaaag actcacaaat tttgatgaat tcca 584 <210> SEQ ID NO 22 <211> LENGTH: 670 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 22 caggtcgact ctagaggatc ggcgttaacc atggttctct ctccgaaaga atgcttttac 60 ctacttttta cccccgaggg catggtgcaa tcggccctgg aattcatcaa aatttgtgag 120 tccttagatt tccgcaacct agtcgtttcc atgaaagcgt cccgggtacc ggtaatgttg 180 gccgcctatc gcctcatggt gaaacgtatg gacgagttgg gcatggatta tcccctccat 240 ctaggggtta ccgaagccgg ggatggggaa tatggccgca ttaaatccac cgctggcatt 300 gccacccttt tagctgatgg cattggcgat actatccggg tatccctcac cgaagccccc 360 gaaaaagaaa ttcccgtttg ctacagcatt ctccaggcgc tgggtttgcg gaaaaccatg 420 gtggaatatg tggcctgtcc ttcctgtggc cgcacgttgt tcaacttgga agacgtgtta 480 catgaagtcc gagatgccac taaacatcta acgggtttag actttcgccg tcatgggctg 540 tattgtcaat ggccccgggg caatggccga tgccgactat ggctatgtgg gtaaacaagc 600 cggttacatt gccatcaacc gtggtcggga agaaattaaa cgagtacccg aaaccgacgg 660 cgtacaggaa 670 <210> SEQ ID NO 23 <211> LENGTH: 596 <212> TYPE: DNA <213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..596) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 23 caggtcgact ctagaggatc ggcgttaacc atggttctct ctccgaaaga atgcttttac 60 ctacttttta cccccgaggg catggtgcaa tcggccctgg aattcatcaa aatttgtgag 120 tccttagatt tccgcaacct agtcgtttcc atgaaagcgt cccgggtacc ggtaatgttg 180 gccgcctatc gcctcatggt gaaacgtatg gacgagttgg gcatggatta tcccctccat 240 ctaggggtta ccgaagccgg ggatggggaa tatggccgca ttaaatccac cgctggcatt 300 gccacccttt tagctgatgg cattggcgat actatccggg tatccctcac cgaagccccc 360 gaaaaagaaa ttcccgtttg ctacagcatt ctccaggcgc tgggtttgcg gaaaaccatg 420 gtggaatatg tggcctgtcc ttcctgtggc cgcacgttgt tcaacttgga agacgtgtta 480 catgaagtcc gagatgccac taaacatcta acgtgtttag actttcgncg tcatgtgctg 540 tattgtcaat ggccccggtg caatggccga tgccgactat ggctatgtgg gtaaac 596 <210> SEQ ID NO 24 <211> LENGTH: 403 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 24 cagacaagga ggaggaaaac tcgaactgtg atggtgggga atgtgccact tgggagtgat 60 caccccataa ggattcaaac catgacgact tcagatacca aggatgttgc gaaaacagta 120 gaggaggtga tgaggatagc agataaagga gctgatcttg ttagaataac agtccagggt 180 aggaaggaag ctgatgcctg ctttgagatc aagaacactc tggttcagaa gaattacaac 240 attccactag tggccgatat tcattttgct cctacggtag ctctaaaggt ggcagaatgt 300 tttgacaaaa ttcgtgtgaa cccaggaaat tttgctgatc gtcgtgctca atttgaaaag 360 ctggaatata ctgacgacga ctaccaaaaa gagctagagc ata 403 <210> SEQ ID NO 25 <211> LENGTH: 293 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 25 cagacaaggc ggaggaaaac tcgaactgtg atggtgggga atgtgccact tggcagtgat 60 caccccataa ggattcaaac catgacgact tcagatacca aggatgttgc gaaaacagta 120 gaggaggtga tgaggatagc agataaagga gctgatcttg ttagaataac agtccagggt 180 aggaaggaag ctgatgcctg ctttgagatc aagaacactc tggttcagaa gaattacaac 240 attccactag tggccgatat tcattttgct cctacggtag ctctaagggt ggc 293 <210> SEQ ID NO 26 <211> LENGTH: 456 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 26 cagacaaggc ggaggaaaac tcgaactgtg atggtgggga atgtgccact tggcagtgat 60 caccccataa ggattcaaac catgacgact tcagatacca aggatgttgc gaaaacagta 120 gaggaggtga tgaggattgc agataaagga gctgatcttg ttagaataac agtccagggt 180 aggaaggaag ctgatgcctg ctttgagatc aagaacaact ctggttcaga agaattacaa 240 ccttccacta gtggacctga tattcatttt gctccttcag tagctttaaa ggtggcagaa 300 tgtttggaca aattaattga aacacacaat ttcttgttga tagtgtacct taattagaaa 360 agctggaatt taccggctac gacttccata aagcgcttgg gcttgtttaa caattggttt 420 ttaccttaat cgaatatttc acagaaattt gaattt 456 <210> SEQ ID NO 27 <211> LENGTH: 619 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 27 caccgaaggt ttctaattta tttctcagat ctcaataaat gtacaaaatg tgtagggatg 60 atgtacattg tatgctcagt tcctgcattg cgtgtttcgc tttacagaat atataaacta 120 cagacttggc tacagcctac agccctactc ctcggcagga ggatccaccc atcggccatg 180 gtccttgatc agctggatca aggcgtcagt tgcaccttcc atggcgatgg cgcgctgcac 240 aacggtcttg ccaacataaa ggtcgatctt tccgggagcg cctccaacgt atccgaaatc 300 ggcatcagcc atctctcctg gtccattgac aatacaaccc atgatagcga tcgaaacacc 360 tggcagatga gaggtctttt ctctaatctc agcgctgatt tcctgaaggt caaagagtgt 420 tcggccgcag gaaggacaag acacatattc agtttttgtg ttgcgcatcc tgcaaccttg 480 gagcaagttg aaagatgtgt ccctcaggaa ctcaaattcc tggtcagcag cttcaaggaa 540 tacaccatca ccaagaccat cgactaagag agcaccaacg ttggccccag caccaatgac 600 aagaccatct ctgtcaatg 619 <210> SEQ ID NO 28 <211> LENGTH: 422 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 28 tcgcttgcac ttgggtgtta cagaagctgg agagggtgaa gatggaagga tgaaatctgc 60 tattggcatt gggacactgc taatggatgg tttgggtgat acaatccgtg tctccctcac 120 agaaccacca gaagaagaga ttgatccttg ccaaaggttg gcaaatcttg ggacgcaggc 180 cgcaaacctt caaattgggg tggccccatt tgaagaaaag cacaggcgct attttgattt 240 ccagcgtagg agtggtcaat tgcctttgca gaaggaggga ggcgatagtt gactacagaa 300 atgtcctgca tcgtgatggt atctgactga tggcagtttc cctggatcag ttgaaggctc 360 ctgatctcct ttataggtat attgcagcaa agcttgcgga tggcatgcct ttcaaggatc 420 tg 422 <210> SEQ ID NO 29 <211> LENGTH: 430 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 29 tcgcttgcac ttgggtgtta cagaagctgg agagggtgaa gatggaagga tgaaatctgc 60 tattggcatt gggacactgc taatggatgg tttgggtgat acaatccgtg tctccctcac 120 agaaccacca gaagaagaga ttgatccttg ccaaaggttg gcaaatcttg ggacgcaggc 180 tgcaaacctt caaattgggg tggccccatt tgaagaaaag cacaggcgtt attttgattt 240 ccagcgtagg agtggtcaat tgcctttgca gaaggagggt gaggaagttg actacagaaa 300 tgtcctgcat cgtgatggta tctgtactga tggcagtttc cctggatcag ttgaaggctc 360 ctgatctcct ttataggtct cttgcagcaa agcttgcggt tggcatgcct ttcaaggatc 420 tggctactgt 430 <210> SEQ ID NO 30 <211> LENGTH: 528 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 30 gacaggcagg gtgcatgctg ctaggaggtt atttgagtac ttacaggcca atggcttgaa 60 cttccctgta attcatcaca taaatttccc tgaaaccatt gacagagatg gtcttgtcat 120 tggggctggg gccaacgttg gtgctctctt agtcgatggt cttggtgatg gtgtattcct 180 tgaggcggct gaccaggaat ttgagttcct gagggacaca tctttcaact tgctccaagg 240 ttgcaggatg cgcaacacaa aaactgaata tgtgtcttgt ccttcctgcg gccgaacact 300 ctttgacctt caggaaatca gcgctgagat tagcgaaaag acctctcatc tgccacgtgt 360 ttcgatcgct atcatgggtt gtattgtcaa tggaccagga gcgctggctg atgccgattt 420 cggatacgtt ggcggcgctc ccggaaagat cgacctttat attggcacga ccgttatgca 480 gcgcgccatc gccatggacg gtgcaactga cgccttgatc cagctgat 528 <210> SEQ ID NO 31 <211> LENGTH: 303 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 31 ggggccaacg ttggtgctct cttagtcgat ggtcttggtg atggtgtatt ccttgaggcg 60 gctgaccagg aatttgagtt cctgagggac acatctttca acttgctcca aggttgcagg 120 atgcgcaaca caaaaactga atatgtgtct tgtccttcct gcggccgaac actctttgac 180 cttcaggaaa tcagcgctga gattagagaa aagacctctc atctgccacg tgtttcgatc 240 gctatcatgg gttgtattgt caatggacca ggagagatgg ctgatgccga tttcggatac 300 gtt 303 <210> SEQ ID NO 32 <211> LENGTH: 613 <212> TYPE: DNA <213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..613) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 32 cgagatggcg ttccatgccn ggcccttcct cctcttcctc ttcttctgcc cccccgctgg 60 cttggaaaag ggagagaaac tcgcgcactc ggttatcgaa gggaggagcg cgggcgaggg 120 tgaggtttcg cccacacgga gctgcgaggt gtttgtagga tctcctaggt gagcccctgc 180 tgcttggaga cagccatggc caccggcgtg gctccagctc ctctcccaca tgtcagagtg 240 cgtcatgggg gcgtcgggtt caccaggagc gtcgattttg cgaaggtctt gtctgctccc 300 ggtgccggca cgatgagagc aagctcctct agaggcaggg cgctcgtggc gaagagctct 360 agtactggct cggagaccat ggagctcgag ccatcttcag aaggaagccc acttttagta 420 cccaggcaga agtactgtga atcaacacac cagacaagga ggaggaaaac tcgaactgtg 480 atggtgggga atgtgccact tggcagtgat catcccataa ggattcaaac catgacgact 540 tcagatacca aggatgttgc aaaaacagta gaggaggtga tgaggatagc agataaagga 600 gctgatcttg tta 613 <210> SEQ ID NO 33 <211> LENGTH: 464 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 33 agagcatgaa atcttctgcg aggaaaaggg tgtcaattat cacgaactca aatcctggcc 60 aagatattgc tgaacttcaa cctgcatccc caggaagccc tcttttggtt cctaggcaaa 120 agtattgtga atcattgcac aaacccatca ggagaaaaac aagcacagta atggttggta 180 acgtggctat tggtagcgag catcctataa gaattcagac catgactaca actgacacta 240 aggatgttgc tgggacagtt gaacaggtga tgagaatagc agataaagga gctgatattg 300 tacggataac agttcaaggg aagaaagaag ctgatgcttg ttttgagatt aaaaacaccc 360 ttgtgcagaa aaattacaac atacccgtgg tggctgatat tcattttgct ccctctgttg 420 ctttgcgggt agctgaatgc tttgataaga ttcgtgtaaa ccct 464 <210> SEQ ID NO 34 <211> LENGTH: 705 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 34 gtagctgaat gctttgataa gattcgtgta aaccctggaa attttgctga tagacgggct 60 caatttgaaa cattagagta cacagaagaa gactatcaga aagaacttga gcatattgaa 120 aaggttttca caccattggt tgagaaatgt aagaaatatg ggagagcaat gcgcattggg 180 acaaaccatg gaagtctttc tgatcgtata atgagctact atggagactc gcctagggga 240 atggtagaat ctgcttttga atttgcaagg atatgccgaa agttagacta tcacaatttt 300 gttttttcta tgaaagcaag caacccagtt atcatggttc aggcataccg cttacttgtg 360 gctgaaatgt atgtccaagg ctgggattat ccattacact tgggtgttac tgaagctgga 420 gaaggtgagg atgggaggat gaagtctgca ataggcattg gaactcttct tcaggatgga 480 ttgggtgata caattagggt ttctctcaca gaaccaccag aggaggagat agacccttgc 540 agaaggttgg caaatcttgg aatgatagct tctgaactcc agaagggggt ggaacctttt 600 gaagaaaagc acagacatta ttttcgactt tcagcgccga tctggtcaat tgccagtgca 660 aaaagagggt gaggaggtgg attacagagg tgtactgcac cgtga 705 <210> SEQ ID NO 35 <211> LENGTH: 564 <212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..564) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 35 aagcncggaa ttcggctcga gaggaactca aatcctggcc aagatattgc tgaacttcaa 60 cctgtatccc caggaagccc tcttttggtt cctaggcaaa agtattgtga atgattacac 120 aaaactgtca ggagaaaaac aaacacagtg atggttggta acgtggctat tggtagcgag 180 catcctataa gaattcagac catgactacg actgacacta aggatgttgc tgggacagtt 240 gaacaggtga tgagaatagc agataaagga gctgatattg tacggataac agttcaaggg 300 aagaaagaag ctgatgcttg ttttgagatt aaaaacaccc ttgttcagaa aaattacaac 360 atactcgtgg tggctgatat tcattttgct ccctctggtg ctttgcgggt agctgaatgc 420 tttgataaga ttcgtgtaaa ccctggaaat tttgctgata gacgggctca atttgaaaca 480 ttagagtaca cagatgatga ctatcagaaa gaacttgagc atattgaaaa ggttttcaca 540 ccattggttg agaaatgtaa gaaa 564 <210> SEQ ID NO 36 <211> LENGTH: 511 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 36 aaaccatgga agtctttctg atcgtataat gagctactat ggagactcgc ctaggggaat 60 ggtagaatct gcttttgaat ttgcaaggat atgccgaaag ttagactatc acaattttgt 120 tttttctatg aaagcaagca acccagttat catggttcag gcataccgct tacttgtggc 180 tgaaatgtat gtccaaggct gggattatcc attacacttg ggtgttactg aagctggaga 240 aggtgaggat gggaggatga agtctgcaat aggcattgga actcttcttc aggatggatt 300 gggtgataca attagggttt ctctcacaga accaccagag gaggagatag acccttgcag 360 aaggttggca aatcttggaa tgatagcttc tgaactccag aagggggtgg aaccttttga 420 agaaaagcac agacattatt ttgactttca gcgccgatct ggtcaattgc cagtgcataa 480 agagggtgag gaggtggatt acagaggtgt a 511 <210> SEQ ID NO 37 <211> LENGTH: 498 <212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..498) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 37 cggaggtggc gtgaatgctt tgataagatt cgtgtaaacc ctggaaattt tgctgataga 60 cgggctcaat ttgaaacatg agagtggaca naataagact atgagaaaga acttgagcat 120 attgaaaagg ttttcacacc attggttgag aaatgtaaga aatatgggag agcaatgcgc 180 attgggacaa accatggaag tctttctgat cgtataatga gctactatgg agactcgcct 240 aggggaatgg tagaatctgc ttttgaattt gcaaggatat gccgaaagtt agactatcac 300 aattttgttt tttctatgaa agcaagcaac ccagttatca tggttcaggc ataccgctta 360 cttgtggctg aaatgtatgt ccaaggctgg gattatccat tacacttggg tgttactgaa 420 gctggagaag gtgaggatgg gaggatgaag tctgcaatag gcattggaac tcttcttcag 480 gatggattgg gtgataca 498 <210> SEQ ID NO 38 <211> LENGTH: 440 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 38 gtagctgaat gctttgataa gattcgtgta aaccctggaa attttgttga tagacgggct 60 caatttgaaa cattagagta cacagaagaa gactatcata aagaacttga gcatattgaa 120 aaggttttca caccattggt tgagaaatgt aagaaatatg ggagagcaat gcgcattggg 180 acaaaccatg gaagtctttc tgatcgtata atgagctact atggagactc gcctagggga 240 atggtagaat ctgcttttga atttgcaagg atatgccgaa agttagacta tcacaatttt 300 gttttttcta tgaaagcaag caacccagtt atcatggttc aggcataccg cttacttgtg 360 gctgaaatgt atgttcaagg ctgggattat ccattacact tgggtgttac tgaagctgga 420 aaaagtgagg atgggaggat 440 <210> SEQ ID NO 39 <211> LENGTH: 353 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 39 aattcggctc gagaggaact caaatcctgg ccaagatatt gctgaacttc aacctgcatc 60 cccaggaagc cctcttttgg ttcctaggca aaagtattgt gaatcattac acaaaactgt 120 caggagaaaa acaaacacag tgatggttgg taacgtggct attggtagcg agcatcctat 180 aagaattcag accatgacta cgactgacac taaggatgtt gctgggacag ttgaacaggt 240 gatgagaata gcagataaag gagctgatat tgtacggata acagttcaag ggaagaaaga 300 agctgatgct tgttttgaga ttaaaaacac ccttgttcaa aaaaattaca aca 353 <210> SEQ ID NO 40 <211> LENGTH: 577 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 40 gatgtttttg tcgtgtattc tattcctatt gcattcagct cactgatttc aattacaaag 60 tcaattttgt aaatcagagg cagagagagt tgtaaagagc ctctgaattt tgatcacacc 120 acacccttct tctcatctcc accagaaatg gctaccggag ctgctgtgcc aactacgttt 180 tctaccctca agacatggga ttccagtttg gggtttgcaa aaaacataga ttttgtgaga 240 gtttccgata tgaagagcat gaaatcttct gcgaggaaaa gggtgtcaat tatcaggaac 300 tcaaatcctg gccaagatat tgctgaactt caacctgcat ccccaggaag ccctcttttg 360 gttcctaggc aaaagtattg tgaatcattg cacaaaccca tcaggagaaa aacaagcaca 420 gtaatggttg gtaacgtggc tattggtagc gagcatccta taagaattca gaccatgact 480 acaactgaca ctaaggatgt tgctgggaca gttgaaccgg tgatgagaat agcagataaa 540 ggagctgata ttgtacggat aacagttcaa gggaaga 577 <210> SEQ ID NO 41 <211> LENGTH: 551 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 41 tggtgctggt tctgatgctg gagcccttct ggtggatggg cttggagatg gacttctttt 60 ggaagcgcca gacaaggatt ttgaatttat tagaaacact tctttcaatt tgttgcaagg 120 ctgcagaatg agaaatacaa agacagagta tgtctcatgt ccatcctgtg gcagaacatt 180 gtttgatctt caagaagtaa gtgcacaaat tcgggagaag acatcacacc tccccggtgt 240 ttcgattgca atcatgggat gcattgtaaa tggaccaggg gagatggctg atgcagactt 300 tgggtatgtg ggaggcactc ccgggaagat tgacctctat gttgggaaga ctgtggtgaa 360 gcgtggaatt gcaatggagc atgcaaccaa tgccttgatc gatctaataa aagaacatgg 420 acgatgggtg gaccctcctg ccgaggagta aaagcaagag cttaattttg agattggcat 480 tcaaggccat agtaagatga gcattgtcat atccaattat tggacacatg taatataagc 540 atacactcaa t 551 <210> SEQ ID NO 42 <211> LENGTH: 869 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 42 gaagcatagt agcatcaatg ccttccttat acagaagact aaaattagca gagtgcatgc 60 ggccaggcgg ttatttgagt acctatccga caattctcta aacttccctg ttattcacca 120 tattcagttc ccaaatggga ttcacagaga tgacttggta attggtgctg gttctgatgc 180 tggagccctt ctggtggatg ggcttggaga tggacttctt ttggaagcgc cagacaagga 240 ttttgaattt attagaaaca cttctttcaa tttgttgcaa ggctgcagaa tgagaaatac 300 aaagacagag tatgtctcat gtccatcctg tggcagaaca ttgtttgatc ttcaagaagt 360 aagtgcacaa attcgggaga agacatcaca cctccctggt gtttcgattg caatcatggg 420 atgcattgta aatggaccag gggagatggc tgatgcagac tttgggtatg tgggaggcac 480 tcccgggaag attgacctct atgttgggaa gactgtggtg aagcgtggaa ttgcaatgga 540 gcatgcaacc aatgccttga tcgatctaat aaaagaacat ggacgatggg tggaccctcc 600 tgccgaggag taaaagcaag agcttaattt tgagattggc attcaaggcc atagtaagat 660 gagcattgtc atatccaatt attgtacaca tgtaatataa gataacactc aatgcttaag 720 tttgagccta gttttaagtt ccttttgaga aagatcccaa ttaaagcttg ttgtgaggaa 780 atcgacagct agaacatgta tacagataac agtgtattgc tttgccccat cagccatcaa 840 taataatgag aatctcttag aatagtgcc 869 <210> SEQ ID NO 43 <211> LENGTH: 291 <212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (1..291) <223> OTHER INFORMATION: unsure at all n locations <400> SEQUENCE: 43 gangnactca aatcctgggc caagatattg ctgaacttca nccctgcatc cccaggnngc 60 cctcttttgg ttcctaggca aaagtattgt gaatcattnc cacaaaactg nccagganaa 120 aaacaaacac agtgatggtt ggtaacgtgg ctattggtag cgagcatcct ataagaattc 180 agaccatgac tacgacngac actaaggatg ttgctgggac agtngaacng gtgatgagaa 240 tagcagataa aggagctgat attgtacgga taacagttca agggaagaaa g 291 <210> SEQ ID NO 44 <211> LENGTH: 388 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 44 cccggtatat ggttcaggca taccgtttac ttgtggctga aatgtatgtc caaggctggg 60 attatccatt acacttgggt gttactgaag ctggagaagg tgaggatggg aggatgaagt 120 ctgcaattgg cattggaact cttcttcagg atggattggg tgatacaatt agggtttctc 180 tcacagaacc accagaagag gagatagatc cttgcagaag gttggcaaat cttggaatga 240 gagcttctga actccagaag ggggtggaac cttttgaaga aaagcacaga cattattttg 300 acttccagcg ccgatctggt caattgccag tgcaaaaaga gggtgaggag gtggattaca 360 gaggtgcact gcaccgtgac ggttctgt 388 <210> SEQ ID NO 45 <211> LENGTH: 211 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 45 cccggttatc atggcgcagg cataccgctt acttgtggct gaaatgtatg tccaaggctg 60 ggattatcca ttacacttgg gtgttactga agctggagga ggtgaggatg acaggatgaa 120 gtctgcaatt ggcattggaa ctcttcttca ggatggattg ggtgatacaa ttagggtgtc 180 tcgcacagaa ccaccagaag aggagataga t 211 <210> SEQ ID NO 46 <211> LENGTH: 276 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 46 tgggcttgga gatggactac ttttggaagc cccggacaag gattttgaat ttattagaaa 60 cacttctttc aatttgttgc aaggctgcag aatgagaaat acaaagacag agtatgtctc 120 atgtccatcc tgtggcagaa cattgtttga tcttcaagaa gtaagtgcac aaattcggga 180 gaagacatca cacctccctg gtgtttcgat tgcaatcatg ggatgcattg taaatggacc 240 aggggagatg gctgatgcag actttgggta tgtggg 276 <210> SEQ ID NO 47 <211> LENGTH: 399 <212> TYPE: DNA <213> ORGANISM: Brassica napus <400> SEQUENCE: 47 cccacgcgtc cgcagggatt cacagggacg agttggtgat ccacgcaggg acatacgctg 60 gggcacttct agtggatgga cttggagatg gtgtaatgct agaagcacct gatcaagact 120 tcgagtttct taggaacact tctttcaact tgttacaagg ctgcaggatg cgtaacacca 180 agacggaata cgtatcgtgc ccgtcttgtg gaagaactct gttcgacttg caagaaatca 240 gcgctgagat cagagaaaag acttcgcatt tgcctggcgt ttcgattgca ataatgggtt 300 gcattgtgaa tggacctggc gaaatggctg atgctgattt cggttatgta ggcggttctc 360 ccgggaaaat cgacctttac gttggaaaga cggtggtca 399 <210> SEQ ID NO 48 <211> LENGTH: 740 <212> TYPE: PRT <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 48 Met Ala Thr Gly Val Leu Pro Ala Pro Val Ser Gly Ile Lys Ile Pro 1 5 10 15 Asp Ser Lys Val Gly Phe Gly Lys Ser Met Asn Leu Val Arg Ile Cys 20 25 30 Asp Val Arg Ser Leu Arg Ser Ala Arg Arg Arg Val Ser Val Ile Arg 35 40 45 Asn Ser Asn Gln Gly Ser Asp Leu Ala Glu Leu Gln Pro Ala Ser Glu 50 55 60 Gly Ser Pro Leu Leu Val Pro Arg Gln Lys Tyr Cys Glu Ser Leu His 65 70 75 80 Lys Thr Val Arg Arg Lys Thr Arg Thr Val Met Val Gly Asn Val Ala 85 90 95 Leu Gly Ser Glu His Pro Ile Arg Ile Gln Thr Met Thr Thr Ser Asp 100 105 110 Thr Lys Asp Ile Thr Gly Thr Val Asp Glu Val Met Arg Ile Ala Asp 115 120 125 Lys Gly Ala Asp Ile Val Arg Ile Thr Val Gln Gly Lys Lys Glu Ala 130 135 140 Asp Ala Cys Phe Glu Ile Lys Asp Lys Leu Val Gln Leu Asn Tyr Asn 145 150 155 160 Ile Pro Leu Val Ala Asp Ile His Phe Ala Pro Thr Val Ala Leu Arg 165 170 175 Val Ala Glu Cys Phe Asp Lys Ile Arg Val Asn Pro Gly Asn Phe Ala 180 185 190 Asp Arg Arg Ala Gln Phe Glu Thr Ile Asp Tyr Thr Glu Asp Glu Tyr 195 200 205 Gln Lys Glu Leu Gln His Ile Glu Gln Val Phe Thr Pro Leu Val Glu 210 215 220 Lys Cys Lys Lys Tyr Gly Arg Ala Met Arg Ile Gly Thr Asn His Gly 225 230 235 240 Ser Leu Ser Asp Arg Ile Met Ser Tyr Tyr Gly Asp Ser Pro Arg Gly 245 250 255 Met Val Glu Ser Ala Phe Glu Phe Ala Arg Ile Cys Arg Lys Leu Asp 260 265 270 Tyr His Asn Phe Val Phe Ser Met Lys Ala Ser Asn Pro Val Ile Met 275 280 285 Val Gln Ala Tyr Arg Leu Leu Val Ala Glu Met Tyr Val His Gly Trp 290 295 300 Asp Tyr Pro Leu His Leu Gly Val Thr Glu Ala Gly Glu Gly Glu Asp 305 310 315 320 Gly Arg Met Lys Ser Ala Ile Gly Ile Gly Thr Leu Leu Gln Asp Gly 325 330 335 Leu Gly Asp Thr Ile Arg Val Ser Leu Thr Glu Pro Pro Glu Glu Glu 340 345 350 Ile Asp Pro Cys Arg Arg Leu Ala Asn Leu Gly Thr Lys Ala Ala Lys 355 360 365 Leu Gln Gln Gly Ala Pro Phe Glu Glu Lys His Arg His Tyr Phe Asp 370 375 380 Phe Gln Arg Arg Thr Gly Asp Leu Pro Val Gln Lys Glu Gly Glu Glu 385 390 395 400 Val Asp Tyr Arg Asn Val Leu His Arg Asp Gly Ser Val Leu Met Ser 405 410 415 Ile Ser Leu Asp Gln Leu Lys Ala Pro Glu Leu Leu Tyr Arg Ser Leu 420 425 430 Ala Thr Lys Leu Val Val Gly Met Pro Phe Lys Asp Leu Ala Thr Val 435 440 445 Asp Ser Ile Leu Leu Arg Glu Leu Pro Pro Val Asp Asp Gln Val Ala 450 455 460 Arg Leu Ala Leu Lys Arg Leu Ile Asp Val Ser Met Gly Val Ile Ala 465 470 475 480 Pro Leu Ser Glu Gln Leu Thr Lys Pro Leu Pro Asn Ala Met Val Leu 485 490 495 Val Asn Leu Lys Glu Leu Ser Gly Gly Ala Tyr Lys Leu Leu Pro Glu 500 505 510 Gly Thr Arg Leu Val Val Ser Leu Arg Gly Asp Glu Pro Tyr Glu Glu 515 520 525 Leu Glu Ile Leu Lys Asn Ile Asp Ala Thr Met Ile Leu His Asp Val 530 535 540 Pro Phe Thr Glu Asp Lys Val Ser Arg Val His Ala Ala Arg Arg Leu 545 550 555 560 Phe Glu Phe Leu Ser Glu Asn Ser Val Asn Phe Pro Val Ile His His 565 570 575 Ile Asn Phe Pro Thr Gly Ile His Arg Asp Glu Leu Val Ile His Ala 580 585 590 Gly Thr Tyr Ala Gly Gly Leu Leu Val Asp Gly Leu Gly Asp Gly Val 595 600 605 Met Leu Glu Ala Pro Asp Gln Asp Phe Asp Phe Leu Arg Asn Thr Ser 610 615 620 Phe Asn Leu Leu Gln Gly Cys Arg Met Arg Asn Thr Lys Thr Glu Tyr 625 630 635 640 Val Ser Cys Pro Ser Cys Gly Arg Thr Leu Phe Asp Leu Gln Glu Ile 645 650 655 Ser Ala Glu Ile Arg Glu Lys Thr Ser His Leu Pro Gly Val Ser Ile 660 665 670 Ala Ile Met Gly Cys Ile Val Asn Gly Pro Gly Glu Met Ala Asp Ala 675 680 685 Asp Phe Gly Tyr Val Gly Gly Ser Pro Gly Lys Ile Asp Leu Tyr Val 690 695 700 Gly Lys Thr Val Val Lys Arg Gly Ile Ala Met Thr Glu Ala Thr Asp 705 710 715 720 Ala Leu Ile Gly Leu Ile Lys Glu His Gly Arg Trp Val Asp Pro Pro 725 730 735 Val Ala Asp Glu 740 <210> SEQ ID NO 49 <211> LENGTH: 603 <212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE: 49 Met Val Gly Asn Val Pro Leu Gly Ser Asp His Pro Ile Arg Ile Gln 1 5 10 15 Thr Met Thr Thr Ser Asp Thr Lys Asp Val Ala Lys Thr Val Glu Glu 20 25 30 Val Met Arg Ile Ala Asp Lys Gly Ala Asp Phe Val Arg Ile Thr Val 35 40 45 Gln Gly Arg Lys Glu Ala Asp Ala Cys Phe Glu Ile Lys Asn Thr Leu 50 55 60 Val Gln Lys Asn Tyr Asn Ile Pro Leu Val Ala Asp Ile His Phe Ala 65 70 75 80 Pro Thr Val Ala Leu Arg Val Ala Glu Cys Phe Asp Lys Ile Arg Val 85 90 95 Asn Pro Gly Asn Phe Ala Asp Arg Arg Ala Gln Phe Glu Gln Leu Glu 100 105 110 Tyr Thr Glu Asp Asp Tyr Gln Lys Glu Leu Glu His Ile Glu Lys Val 115 120 125 Pro Asn Ile Ser Leu Phe Ser Val Asn Leu Val Phe Ser Pro Leu Val 130 135 140 Glu Lys Cys Lys Gln Tyr Gly Arg Ala Met Arg Ile Gly Thr Asn His 145 150 155 160 Gly Ser Leu Ser Asp Arg Ile Met Ser Tyr Tyr Gly Asp Ser Pro Arg 165 170 175 Gly Met Val Glu Ser Ala Leu Glu Phe Ala Arg Ile Cys Arg Lys Leu 180 185 190 Asp Phe His Asn Phe Val Phe Ser Met Lys Ala Ser Asn Pro Val Ile 195 200 205 Met Val Gln Ala Tyr Arg Leu Leu Val Ala Glu Met Tyr Asn Leu Gly 210 215 220 Trp Asp Tyr Pro Leu His Leu Gly Val Thr Glu Ala Gly Glu Gly Glu 225 230 235 240 Asp Gly Arg Met Lys Ser Ala Ile Gly Ile Gly Thr Leu Leu Met Asp 245 250 255 Gly Leu Gly Asp Thr Ile Arg Val Ser Leu Thr Glu Pro Pro Glu Glu 260 265 270 Glu Ile Asp Pro Cys Arg Arg Leu Ala Asn Leu Gly Thr His Ala Ala 275 280 285 Asp Leu Gln Ile Gly Val Ala Pro Phe Glu Glu Lys His Arg Arg Tyr 290 295 300 Phe Asp Phe Gln Arg Arg Ser Gly Gln Leu Pro Leu Gln Lys Glu Ala 305 310 315 320 Pro Glu Leu Leu Tyr Arg Ser Leu Ala Ala Lys Leu Val Val Gly Met 325 330 335 Pro Phe Lys Asp Leu Ala Thr Val Asp Ser Ile Leu Leu Lys Glu Leu 340 345 350 Pro Pro Val Glu Asp Ala Gln Ala Arg Leu Ala Leu Lys Arg Leu Val 355 360 365 Asp Ile Ser Met Gly Val Leu Thr Pro Leu Ser Glu Gln Leu Thr Lys 370 375 380 Pro Leu Pro His Ala Ile Ala Leu Val Asn Val Asp Glu Leu Ser Ser 385 390 395 400 Gly Ala His Lys Leu Leu Pro Glu Gly Thr Arg Leu Ala Val Thr Leu 405 410 415 Arg Gly Asp Glu Ser Tyr Glu Gln Leu Asp Leu Leu Lys Gly Val Asp 420 425 430 Asp Ile Thr Met Leu Leu His Ser Val Pro Tyr Gly Glu Glu Lys Thr 435 440 445 Gly Arg Val His Ala Ala Arg Arg Leu Phe Glu Tyr Leu Glu Thr Asn 450 455 460 Gly Leu Asn Phe Pro Val Ile His His Ile Glu Phe Pro Lys Ser Val 465 470 475 480 Asn Arg Asp Asp Leu Val Ile Gly Ala Gly Ala Asn Val Gly Ala Leu 485 490 495 Leu Val Asp Gly Leu Gly Asp Gly Val Leu Leu Glu Ala Ala Asp Gln 500 505 510 Glu Phe Glu Phe Leu Arg Asp Thr Ser Phe Asn Leu Leu Gln Gly Cys 515 520 525 Arg Met Arg Asn Thr Lys Thr Ile Ala Ile Met Gly Cys Ile Val Asn 530 535 540 Gly Pro Gly Glu Met Ala Asp Ala Asp Phe Gly Tyr Val Gly Gly Ala 545 550 555 560 Pro Gly Lys Ile Asp Leu Tyr Val Gly Lys Thr Val Val Gln Arg Gly 565 570 575 Ile Ala Met Glu Gly Ala Thr Asp Ala Leu Ile Gln Leu Ile Lys Asp 580 585 590 His Gly Arg Trp Val Asp Pro Pro Val Glu Glu 595 600 <210> SEQ ID NO 50 <211> LENGTH: 372 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <400> SEQUENCE: 50 Met His Asn Gln Ala Pro Ile Gln Arg Arg Lys Ser Thr Arg Ile Tyr 1 5 10 15 Val Gly Asn Val Pro Ile Gly Asp Gly Ala Pro Ile Ala Val Gln Ser 20 25 30 Met Thr Asn Thr Arg Thr Thr Asp Val Glu Ala Thr Val Asn Gln Ile 35 40 45 Lys Ala Leu Glu Arg Val Gly Ala Asp Ile Val Arg Val Ser Val Pro 50 55 60 Thr Met Asp Ala Ala Glu Ala Phe Lys Leu Ile Lys Gln Gln Val Asn 65 70 75 80 Val Pro Leu Val Ala Asp Ile His Phe Asp Tyr Arg Ile Ala Leu Lys 85 90 95 Val Ala Glu Tyr Gly Val Asp Cys Leu Arg Ile Asn Pro Gly Asn Ile 100 105 110 Gly Asn Glu Glu Arg Ile Arg Met Val Val Asp Cys Ala Arg Asp Lys 115 120 125 Asn Ile Pro Ile Arg Ile Gly Val Asn Ala Gly Ser Leu Glu Lys Asp 130 135 140 Leu Gln Glu Lys Tyr Gly Glu Pro Thr Pro Gln Ala Leu Leu Glu Ser 145 150 155 160 Ala Met Arg His Val Asp His Leu Asp Arg Leu Asn Phe Asp Gln Phe 165 170 175 Lys Val Ser Val Lys Ala Ser Asp Val Phe Leu Ala Val Glu Ser Tyr 180 185 190 Arg Leu Leu Ala Lys Gln Ile Asp Gln Pro Leu His Leu Gly Ile Thr 195 200 205 Glu Ala Gly Gly Ala Arg Ser Gly Ala Val Lys Ser Ala Ile Gly Leu 210 215 220 Gly Leu Leu Leu Ser Glu Gly Ile Gly Asp Thr Leu Arg Val Ser Leu 225 230 235 240 Ala Ala Asp Pro Val Glu Glu Ile Lys Val Gly Phe Asp Ile Leu Lys 245 250 255 Ser Leu Arg Ile Arg Ser Arg Gly Ile Asn Phe Ile Ala Cys Pro Thr 260 265 270 Cys Ser Arg Gln Glu Phe Asp Val Ile Gly Thr Val Asn Ala Leu Glu 275 280 285 Gln Arg Leu Glu Asp Ile Ile Thr Pro Met Asp Val Ser Ile Ile Gly 290 295 300 Cys Val Val Asn Gly Pro Gly Glu Ala Leu Val Ser Thr Leu Gly Val 305 310 315 320 Thr Gly Gly Asn Lys Lys Ser Gly Leu Tyr Glu Asp Gly Val Arg Lys 325 330 335 Asp Arg Leu Asp Asn Asn Asp Met Ile Asp Gln Leu Glu Ala Arg Ile 340 345 350 Arg Ala Lys Ala Ser Gln Leu Asp Glu Ala Arg Arg Ile Asp Val Gln 355 360 365 Gln Val Glu Lys 370 <210> SEQ ID NO 51 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named CINCO <400> SEQUENCE: 51 cgctgcccag aatggacctc cctag 25 <210> SEQ ID NO 52 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named SEIS <400> SEQUENCE: 52 cagccgcgtt ttgacttgaa acgtgc 26 <210> SEQ ID NO 53 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named MPD-Nde5′ <400> SEQUENCE: 53 gccatatgac cgtttacaca gcatccg 27 <210> SEQ ID NO 54 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named MPD-Eco3′ <400> SEQUENCE: 54 tcgaattctc attattcctt tggtagacca gtctt 35 <210> SEQ ID NO 55 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named hPMK1 <400> SEQUENCE: 55 tggttaacat atggccccgc tgggaggcgc 30 <210> SEQ ID NO 56 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named hPMK4 <400> SEQUENCE: 56 aggttaactc aattaaagtc tggagcggat aaattctatc 40 <210> SEQ ID NO 57 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named UNO <400> SEQUENCE: 57 cgggcctcgt ttggctgtcg cactg 25 <210> SEQ ID NO 58 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named DOS <400> SEQUENCE: 58 cgcgggtgga aggaccttgt ggagg 25 <210> SEQ ID NO 59 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named MK-Hpa5′ <400> SEQUENCE: 59 aagttaacat atgtcattac cgttcttaac ttc 33 <210> SEQ ID NO 60 <211> LENGTH: 34 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named MK-Hpa3′ <400> SEQUENCE: 60 cggttaactc attatgaagt ccatggtaaa ttcg 34 <210> SEQ ID NO 61 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named idi5X <400> SEQUENCE: 61 cccctcgaga ttatgcaaac ggaacacgtc 30 <210> SEQ ID NO 62 <211> LENGTH: 31 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named idi3X <400> SEQUENCE: 62 ggctcgagtt atttaagctg ggtaaatgca g 31 <210> SEQ ID NO 63 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named pBAD-mut1 <400> SEQUENCE: 63 ctgagagtgc accatctgcg gtgtgaaata cc 32 <210> SEQ ID NO 64 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named pBAD-Link1 <400> SEQUENCE: 64 aattctaagg aggtttaaac taaggaggta cgtaaggagg 40 <210> SEQ ID NO 65 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named pBAD-Link2 <400> SEQUENCE: 65 tcgacctcct tacgtacctc cttagtttaa acctccttag 40 <210> SEQ ID NO 66 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named pBAD-D2 <400> SEQUENCE: 66 tcatactccc gccattcaga g 21 <210> SEQ ID NO 67 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named pBAD-U3 <400> SEQUENCE: 67 ccgccaaaac agccaagctt g 21 <210> SEQ ID NO 68 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named pRS-L1 <400> SEQUENCE: 68 gatccgttta aacgcccggg cggccgcg 28 <210> SEQ ID NO 69 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named pRS-L2 <400> SEQUENCE: 69 aattcgcggc cgcccgggcg tttaaacg 28 <210> SEQ ID NO 70 <211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named 1PE <400> SEQUENCE: 70 cgcggtgtgg gtgagcatga tg 22 <210> SEQ ID NO 71 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named 22PE <400> SEQUENCE: 71 aaatctcccg ggttacccgt ctgttactgc 30 <210> SEQ ID NO 72 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named 3PE <400> SEQUENCE: 72 gcgtttaaac tggacgaagc gcgtcgaatt gac 33 <210> SEQ ID NO 73 <211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named 4PE <400> SEQUENCE: 73 tgcacgaccg cccagttgtt cc 22 <210> SEQ ID NO 74 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named CAT1 <400> SEQUENCE: 74 gagtccgaat aaatacctgt g 21 <210> SEQ ID NO 75 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named CAT4 <400> SEQUENCE: 75 ccgaatttct gccattcatc c 21 <210> SEQ ID NO 76 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named 0PE <400> SEQUENCE: 76 tgggctttgt cacgagcaca c 21 <210> SEQ ID NO 77 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Designed primer named 5PE <400> SEQUENCE: 77 ggcccatagc aaaaccgaca g 21 <210> SEQ ID NO 78 <211> LENGTH: 372 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <400> SEQUENCE: 78 Met His Asn Gln Ala Pro Ile Gln Arg Arg Lys Ser Thr Arg Ile Tyr 1 5 10 15 Val Gly Asn Val Pro Ile Gly Asp Gly Ala Pro Ile Ala Val Gln Ser 20 25 30 Met Thr Asn Thr Arg Thr Thr Asp Val Glu Ala Thr Val Asn Gln Ile 35 40 45 Lys Ala Leu Glu Arg Val Gly Ala Asp Ile Val Arg Val Ser Val Pro 50 55 60 Thr Met Asp Ala Ala Glu Ala Phe Lys Leu Ile Lys Gln Gln Val Asn 65 70 75 80 Val Pro Leu Val Ala Asp Ile His Phe Asp Tyr Arg Ile Ala Leu Lys 85 90 95 Val Ala Glu Tyr Gly Val Asp Cys Leu Arg Ile Asn Pro Gly Asn Ile 100 105 110 Gly Asn Glu Glu Arg Ile Arg Met Val Val Asp Cys Ala Arg Asp Lys 115 120 125 Asn Ile Pro Ile Arg Ile Gly Val Asn Ala Gly Ser Leu Glu Lys Asp 130 135 140 Leu Gln Glu Lys Tyr Gly Glu Pro Thr Pro Gln Ala Leu Leu Glu Ser 145 150 155 160 Ala Met Arg His Val Asp His Leu Asp Arg Leu Asn Phe Asp Gln Phe 165 170 175 Lys Val Ser Val Lys Ala Ser Asp Val Phe Leu Ala Val Glu Ser Tyr 180 185 190 Arg Leu Leu Ala Lys Gln Ile Asp Gln Pro Leu His Leu Gly Ile Thr 195 200 205 Glu Ala Gly Gly Ala Arg Ser Gly Ala Val Lys Ser Ala Ile Gly Leu 210 215 220 Gly Leu Leu Leu Ser Glu Gly Ile Gly Asp Thr Leu Arg Val Ser Leu 225 230 235 240 Ala Ala Asp Pro Val Glu Glu Ile Lys Val Gly Phe Asp Ile Leu Lys 245 250 255 Ser Leu Arg Ile Arg Ser Arg Gly Ile Asn Phe Ile Ala Cys Pro Thr 260 265 270 Cys Ser Arg Gln Glu Phe Asp Val Ile Gly Thr Val Asn Ala Leu Glu 275 280 285 Gln Arg Leu Glu Asp Ile Ile Thr Pro Met Asp Val Ser Ile Ile Gly 290 295 300 Cys Val Val Asn Gly Pro Gly Glu Ala Leu Val Ser Thr Leu Gly Val 305 310 315 320 Thr Gly Gly Asn Lys Lys Ser Gly Leu Tyr Glu Asp Gly Val Arg Lys 325 330 335 Asp Arg Leu Asp Asn Asn Asp Met Ile Asp Gln Leu Glu Ala Arg Ile 340 345 350 Arg Ala Lys Ala Ser Gln Leu Asp Glu Ala Arg Arg Ile Asp Val Gln 355 360 365 Gln Val Glu Lys 370 <210> SEQ ID NO 79 <211> LENGTH: 740 <212> TYPE: PRT <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 79 Met Ala Thr Gly Val Leu Pro Ala Pro Val Ser Gly Ile Lys Ile Pro 1 5 10 15 Asp Ser Lys Val Gly Phe Gly Lys Ser Met Asn Leu Val Arg Ile Cys 20 25 30 Asp Val Arg Ser Leu Arg Ser Ala Arg Arg Arg Val Ser Val Ile Arg 35 40 45 Asn Ser Asn Gln Gly Ser Asp Leu Ala Glu Leu Gln Pro Ala Ser Glu 50 55 60 Gly Ser Pro Leu Leu Val Pro Arg Gln Lys Tyr Cys Glu Ser Leu His 65 70 75 80 Lys Thr Val Arg Arg Lys Thr Arg Thr Val Met Val Gly Asn Val Ala 85 90 95 Leu Gly Ser Glu His Pro Ile Arg Ile Gln Thr Met Thr Thr Ser Asp 100 105 110 Thr Lys Asp Ile Thr Gly Thr Val Asp Glu Val Met Arg Ile Ala Asp 115 120 125 Lys Gly Ala Asp Ile Val Arg Ile Thr Val Gln Gly Lys Lys Glu Ala 130 135 140 Asp Ala Cys Phe Glu Ile Lys Asp Lys Leu Val Gln Leu Asn Tyr Asn 145 150 155 160 Ile Pro Leu Val Ala Asp Ile His Phe Ala Pro Thr Val Ala Leu Arg 165 170 175 Val Ala Glu Cys Phe Asp Lys Ile Arg Val Asn Pro Gly Asn Phe Ala 180 185 190 Asp Arg Arg Ala Gln Phe Glu Thr Ile Asp Tyr Thr Glu Asp Glu Tyr 195 200 205 Gln Lys Glu Leu Gln His Ile Glu Gln Val Phe Thr Pro Leu Val Glu 210 215 220 Lys Cys Lys Lys Tyr Gly Arg Ala Met Arg Ile Gly Thr Asn His Gly 225 230 235 240 Ser Leu Ser Asp Arg Ile Met Ser Tyr Tyr Gly Asp Ser Pro Arg Gly 245 250 255 Met Val Glu Ser Ala Phe Glu Phe Ala Arg Ile Cys Arg Lys Leu Asp 260 265 270 Tyr His Asn Phe Val Phe Ser Met Lys Ala Ser Asn Pro Val Ile Met 275 280 285 Val Gln Ala Tyr Arg Leu Leu Val Ala Glu Met Tyr Val His Gly Trp 290 295 300 Asp Tyr Pro Leu His Leu Gly Val Thr Glu Ala Gly Glu Gly Glu Asp 305 310 315 320 Gly Arg Met Lys Ser Ala Ile Gly Ile Gly Thr Leu Leu Gln Asp Gly 325 330 335 Leu Gly Asp Thr Ile Arg Val Ser Leu Thr Glu Pro Pro Glu Glu Glu 340 345 350 Ile Asp Pro Cys Arg Arg Leu Ala Asn Leu Gly Thr Lys Ala Ala Lys 355 360 365 Leu Gln Gln Gly Ala Pro Phe Glu Glu Lys His Arg His Tyr Phe Asp 370 375 380 Phe Gln Arg Arg Thr Gly Asp Leu Pro Val Gln Lys Glu Gly Glu Glu 385 390 395 400 Val Asp Tyr Arg Asn Val Leu His Arg Asp Gly Ser Val Leu Met Ser 405 410 415 Ile Ser Leu Asp Gln Leu Lys Ala Pro Glu Leu Leu Tyr Arg Ser Leu 420 425 430 Ala Thr Lys Leu Val Val Gly Met Pro Phe Lys Asp Leu Ala Thr Val 435 440 445 Asp Ser Ile Leu Leu Arg Glu Leu Pro Pro Val Asp Asp Gln Val Ala 450 455 460 Arg Leu Ala Leu Lys Arg Leu Ile Asp Val Ser Met Gly Val Ile Ala 465 470 475 480 Pro Leu Ser Glu Gln Leu Thr Lys Pro Leu Pro Asn Ala Met Val Leu 485 490 495 Val Asn Leu Lys Glu Leu Ser Gly Gly Ala Tyr Lys Leu Leu Pro Glu 500 505 510 Gly Thr Arg Leu Val Val Ser Leu Arg Gly Asp Glu Pro Tyr Glu Glu 515 520 525 Leu Glu Ile Leu Lys Asn Ile Asp Ala Thr Met Ile Leu His Asp Val 530 535 540 Pro Phe Thr Glu Asp Lys Val Ser Arg Val His Ala Ala Arg Arg Leu 545 550 555 560 Phe Glu Phe Leu Ser Glu Asn Ser Val Asn Phe Pro Val Ile His His 565 570 575 Ile Asn Phe Pro Thr Gly Ile His Arg Asp Glu Leu Val Ile His Ala 580 585 590 Gly Thr Tyr Ala Gly Gly Leu Leu Val Asp Gly Leu Gly Asp Gly Val 595 600 605 Met Leu Glu Ala Pro Asp Gln Asp Phe Asp Phe Leu Arg Asn Thr Ser 610 615 620 Phe Asn Leu Leu Gln Gly Cys Arg Met Arg Asn Thr Lys Thr Glu Tyr 625 630 635 640 Val Ser Cys Pro Ser Cys Gly Arg Thr Leu Phe Asp Leu Gln Glu Ile 645 650 655 Ser Ala Glu Ile Arg Glu Lys Thr Ser His Leu Pro Gly Val Ser Ile 660 665 670 Ala Ile Met Gly Cys Ile Val Asn Gly Pro Gly Glu Met Ala Asp Ala 675 680 685 Asp Phe Gly Tyr Val Gly Gly Ser Pro Gly Lys Ile Asp Leu Tyr Val 690 695 700 Gly Lys Thr Val Val Lys Arg Gly Ile Ala Met Thr Glu Ala Thr Asp 705 710 715 720 Ala Leu Ile Gly Leu Ile Lys Glu His Gly Arg Trp Val Asp Pro Pro 725 730 735 Val Ala Asp Glu 740 <210> SEQ ID NO 80 <211> LENGTH: 155 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 80 aaaaatcgga aaaatggcga ctggagtatt gccagctccg gtttctggga tcaagatacc 60 ggattcgaaa gtcgggtttg gtaaaagcat gaatcttgtg agaatttgtg atgttaggag 120 tctaagatct gctgatgagt agatttcata aaagt 155 <210> SEQ ID NO 81 <211> LENGTH: 42 <212> TYPE: PRT <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 81 Met Ala Thr Gly Val Leu Pro Ala Pro Val Ser Gly Ile Lys Ile Pro 1 5 10 15 Asp Ser Lys Val Gly Phe Gly Lys Ser Met Asn Leu Val Arg Ile Cys 20 25 30 Asp Val Arg Ser Leu Arg Ser Ala Asp Glu 35 40 <210> SEQ ID NO 82 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 82 atgagaggat cgcaycayca ycaycaycay cayggatccg catgc 45 <210> SEQ ID NO 83 <211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 83 Met Arg Gly Ser His His His His His His Gly Ser 1 5 10 <210> SEQ ID NO 84 <211> LENGTH: 59 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 84 atgagaggat cgcaycayca ycaycaycay ggatctgctg atgagtagat ttcgcatgc 59 <210> SEQ ID NO 85 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 85 Met Arg Gly Ser His His His His His His Gly Ser Ala Asp Glu 1 5 10 15 

What is claimed is:
 1. A transgenic cotton plant or seed, cells or tissues thereof comprising event EE-GH1 in its genome.
 2. The transgenic cotton plant, or seed, cells or tissues of claim 1, the genomic DNA of which, when analyzed using the Elite event identification protocol for EE-GH1 with two primers comprising the nucleotide sequence of SEQ ID NO: 2 and SEQ ID NO: 3 respectively, yields a DNA fragment of between 250 and 290 bp.
 3. The cotton plant, or seed, cells or tissues thereof, according to claim 2, wherein said DNA fragment is a fragment of about 269 bp.
 4. A cotton plant, or seed, cells or tissues thereof, obtained by propagation of and/or breeding with a cotton plant grown from the seed deposited at the ATCC under accession number PTA-3343.
 5. A cotton plant, seed, cells or tissues thereof which is the progeny of the seed deposited at the ATCC under accession number PTA-3343
 6. A method for identifying elite event EE-GH1 in biological samples, which method comprises detecting an EE-GH1 specific region with a primer or probe which specifically recognizes the 5′ flanking region of SEQ ID NO: 3 or the 3′ flanking region of SEQ ID NO: 4 of EE-GH1.
 7. The method of claim 6, said method comprising amplifying a DNA fragment of between 100 and 350 bp from a nucleic acid present in said biological samples using a polymerase chain reaction with at least two primers, one of which recognizes the 5′ or 3′ flanking region of EE-GH1 and the other which recognizes a sequence within the foreign DNA of EE-GH1.
 8. The method of claim 7, wherein said one primer recognizes a sequence within the 5′ flanking region of SEQ ID NO: 3 and said other primer recognizes a sequence within the foreign DNA of EE-GH1.
 9. The method of claim 8, wherein said primer recognizing a sequence within the 5′ flanking region of EE-GH1 comprises the sequence of SEQ ID NO:
 2. 10. The method of any one of claims 6 to 9, wherein said primer recognizing a sequence within the foreign DNA comprises the sequence of SEQ ID NO:
 1. 11. A method for identifying EE-GH1 in a biological sample, which method comprises detecting an EE-GH1 specific region with a specific primer or probe which hybridizes under stringent conditions to a sequence within the 5′ of SEQ ID NO: 3 or within the 3′ flanking sequence of SEQ ID NO: 4 of EE-GH1.
 12. A method for identifying a transgenic plant, or cells or tissues thereof, comprising the elite event EE-GH1, which method comprises establishing that genomic DNA can be used, according to a PCR identification protocol, to amplify a DNA fragment of between 250 and 290 bp, using a polymerase chain reaction with two primers having the nucleotide sequence of SEQ ID NO: 1 and SEQ ID NO: 2, respectively.
 13. A kit for identifying elite event EE-GH1 in biological samples, said kit comprising at least one PCR primer or probe, which recognizes a sequence within the 5′ flanking region of SEQ ID NO: 3 or the 3′ flanking region of SEQ ID NO: 4 of EE-GH1.
 14. The kit of claim 13, wherein said at least one PCR primer recognizes a sequence within the plant DNA in SEQ ID NO:
 3. 15. The kit of claim 14, wherein said primer recognizing a sequence within the plant DNA in SEQ ID NO: 3 comprises the sequence of SEQ ID NO:
 2. 16. The kit of claims 13 to 15, which further comprises at least a second PCR primer or probe which recognizes a sequence within the foreign DNA of EE-GH1.
 17. The kit of claim 16, wherein said primer recognizing a sequence within the foreign DNA of EE-GH1 comprises the sequence of SEQ ID NO:
 1. 18. A method for confirming seed purity, which method comprises detecting an EE-GH1 specific DNA sequence with a specific primer or probe which specifically recognizes a sequence within the 5′ flanking region of SEQ ID NO: 3 or the 3′ flanking region of SEQ ID NO: 4 of EE-GH1, in seed samples.
 19. A method for screening seeds for the presence of EE-GH1, which method comprises detecting an EE-GH1 specific DNA sequence with a specific primer or probe which specifically recognizes a sequence within the 5′ flanking region of SEQ ID NO: 3 or the 3′ flanking region of SEQ ID NO: 4 of EE-GH1, in samples of seed lots.
 20. A seed deposited at the ATCC under accession number PTA-3343.
 21. A cotton seed comprising elite event EE-GH1, reference seed comprising said event having been deposited at the ATCC under accession number PTA-3343.
 22. A cotton plant, cell or tissue or plant material thereof comprising elite event EE-GH1, derived from the seed of claim
 21. 23. Transgenic cotton plants, seeds, cells or tissues, the genomic DNA of which comprises a transgene integrated into the chromosomal DNA in a region which comprises a sequence of at least 40 bp which hybridizes under stringent conditions with a sequence which is complementary to the sequence of SEQ ID NO:
 5. 24. A process for producing a transgenic cotton plant or cell or tissue of a cotton plant, said process comprising introducing a recombinant DNA molecule into a region of cotton chromosomal DNA corresponding to a sequence of at least 40 bp that hybridizes under stringent conditions with a sequence that is complementary to the sequence of SEQ ID NO: 5, and, optionally, regenerating a cotton plant from the transformed cotton cell or tissue.
 25. The process of claim 24, wherein said recombinant DNA molecule comprises an herbicide resistance gene.
 26. The plant or cell or tissue of a cotton plant obtained by the process of claims 24 or
 25. 27. A transgenic cotton plant or seed, cells or tissues thereof comprising (i) event EE-GH1 in its genome; or (ii) event EE-GH1 with the proviso that the bar gene used in the event is substituted with a nucleic acid sequence that hybridizes to the complement of the bar gene under stringent conditions. 